B3 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
15 February 2001 (15.02.2001) 




PCT 



(10) International Publication Number 

WO 01/10903 A2 



(51) International Patent Classification 7 : C07K 14/00 

(21) International Application Number: PCT/US00/21878 

(22) International Filing Date: 9 August 2000 (09.08.2000) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

60/147.986 9 August 1999(09.08.1999) US 

60/160,807 21 October 1999(21.10.1999) US 

(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier applications: 

US 60/147,986 (CIP) 

Filed on 9 August 1999 (09.08. 1 999) 

US 60/160,807 (CIP) 

Filed on 21 October 1999(21.10.1999) 

(71) Applicant (for all designated Stales except US): 1NCVTE 
GENOMICS, INC. [US/US]; 3160 Porter Drive, Palo 
Alto, CA 94304 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): YUE, Henry 
[US/US]; 826 Lois Avenue, Sunnyvale, CA 94087 (US). 
LAL, Preeti [IN/US]; 2382 Lass Drive, Santa Clara, CA 
95054 (US). TANG, Y, Tom [CN/US]; 4230 Ranwick 
Court, San Jose, CA 95118 (US). BANDMAN, Olga 
[US/US]; 366 Anna Avenue, Mountain View, CA 94043 



(US). BAUGHN, Mariah, R. [US/US]; 14244 Santiago 
Road, San Leandro, CA 94577 (US). AZIMZAI, Yalda 
[US/US]; 2045 Rock Springs Drive, Hay ward. CA 94545 
(US). LU, Dyung, Aina, M. [US/US], 55 Park Belmont 
Place, San Jose, CA 95136 (US). YANG, Junming 
[CN7USJ; 7125 Bark Lane, San Jose, CA 95129 (US). 

(74) Agents: HAMLET-COX, Diana et al.; Incyte Genomics, 
Inc., 3160 Porter Drive, Palo Alto, CA 94304 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB. BG, BR, BY, BZ, CA, CH, CN. CR, CU. CZ, 
DE, DK, DM, DZ. EE, ES, Fl, GB, GD, GE, GH, GM. HR, 
HU, ID, EL, IN, IS. JP, KE, KG, KP, KR, KZ. LC, LK, LR. 
LS, LT, LU, LV, MA, MD, MG, MK, MN. MW, MX, MZ, 
NO, NZ, PL, PT, RO. RU, SD, SE, SG, SI, SK. SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS. MW, MZ, SD, SL, SZ, TZ. UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 
upon receipt of that report. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing al the begin- 
ning of each regular issue of the PCT Gazette. 



< 
o 



o 



(54) Title: PROTEASES AND PROTEASE INHIBITORS 

(57) Abstract: The invention provides human proteases and protease inhibitors (PPIM) and polynucleotides which identify and 
encode PPIM. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also 
provides methods for diagnosing, treating, or preventing disorders associated with expression of PPIM. 



WC 0V09O3A2 ' > 



WO 01/10903 



PCT/USOO/21878 



PROTEASES AND PROTEASE INHIBITORS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of proteases and protease 
5 inhibitors and to the use of these sequences in the diagnosis, treatment, and prevention of cell 
proliferative and autoimmune/inflammatory disorders. 

BACKGROUND OF THE INVENTION 

Proteolytic processing is an essential component of normal cell growth, differentiation, 

10 remodeling, and homeostasis. The cleavage of peptide bonds within cells is necessary for the maturation 
of precursor proteins to their active forms, the removal of signal sequences from targeted proteins, the 
degradation of incorrectly folded proteins, and the controlled turnover of peptides within the cell. 
Proteases participate in apoptosis, inflammation, and tissue remodeling during embryonic development, 
wound healing, and normal growth. They are necessary components of bacterial, parasitic, and viral 

15 invasion and replication within a host. Four principal categories of mammalian proteases have been 
identified based on active site structure, mechanism of action, and overall three-dimensional structure. 
(See Beynon, R.J. and J. S. Bond (1994) Proteolytic Enzymes: A Practical Approach , Oxford University 
Press, New York NY, pp. 1-5.) 

The serine proteases (SPs) are a large family of proteolytic enzymes that include the digestive 

20 enzymes, trypsin and chymotrypsin; components of the complement cascade and of the blood-clotting 
cascade; and enzymes that control the degradation and turnover of macromolecules of the extracellular 
matrix. SPs are so named because of the presence of a serine residue found in the active catalytic site for 
protein cleavage. The active site of all SPs is composed of a triad of residues including the 
aforementioned serine, an aspartate, and a histidine residue. SPs have a wide range of substrate 

25 specificities and can be subdivided into subfamilies on the basis of these specificities. The main sub- 
families are trypases which cleave after arginine or lysine; aspases which cleave after aspartate; 
chymases which cleave after phenylalanine or leucine; metascs which cleavage after methionine; and 
serases which cleave after serine. Op protease is a unique member of the serine protease family as its 
activity is controlled by a regulatory subunit that binds and hydrolyzes ATP. Clp protease was 

30 originally found in plant chloroplasts but is believed to be widespread in both prokaryotic and eukaryotic 
cells (Maurizi, M.R. et al. (1990) J. Biol. Chem. 2665:12546-12552). SKD3, a mammalian homolog of 
the bacterial Clp regulatory subunit, has recently been identified in mouse (Pericr, F. ct al. (1995) Gene 
152:157-163). 

Cysteine proteases arc involved in diverse cellular processes ranging from the processing of 
35 precursor proteins to intracellular degradation. Mammalian cysteine proteases include lysosomal 
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cathepsins and cytosolic calcium activated proteases, calpains. Of particular note, cysteine proteases are 
produced by monocytes, macrophages and other cells of the immune system which migrate to sites of 
inflammation and in their protective role secrete various molecules to repair damaged tissue. These cells 
may overproduce the same molecules and cause tissue destruction in certain disorders. In autoimmune 

5 diseases such as rheumatoid arthritis, the secretion of the cysteine protease, cathepsin C, degrades 
collagen, laminin, elastin and other structural proteins found in the extracellular matrix of bones. The 
cathepsin family oflysosomal proteases includes the cysteine proteases: cathepsins B, H, K, L, 02, and 
S; and the aspartyl proteases; cathepsins D and G. Various members of this endosomal protease family 
are differentially expressed. Some, such as cathepsin D, have a ubiquitous tissue distribution while 

10 others, such as cathepsin L, are found only in monocytes, macrophages, and other cells of the immune 
system. 

Aspartic proteases include bacterial penicillopepsin, mammalian pepsin, renin, chymosin, and 
certain fungal proteases. The characteristic active site residues of aspartic proteases are a pair of 
aspartic acid residues, for example, Asp33 and Asp213 in penicillopepsin. Aspartic proteases are also 

15 called acid proteases because the optimum pH for their activity is between 2 and 3. In this pH range, one 
of the aspartate residues is ionized and the other is neutral. A potent inhibitor of aspartic proteases is the 
hexapeptide pepstatin which, in the transition state, resembles normal substrates. 

Carboxypeptidases A and B are the principal mammalian representatives of the metallo-protease 
family. Both are exopeptidases of similar structure and active site configuration. Carboxypepudase A, 

20 like chymotrypsin, prefers C-terminal aromatic and aliphatic side chains of hydrophobic nature, whereas 
carboxypeptidase B is directed toward basic arginine and lysine residues. Active site components 
include zinc, which coordinates two glutamic acid and one histidine residues in the protein. 

Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 
pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria. The UCS 

25 mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory 

proteins that control cellular processes such as gene transcription and cell cycle progression. In the UCS 
pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable protein. The 
ubiquinated protein is then recognized and degraded by proteasome, a large, multisubunit proteolytic 
enzyme complex, and ubiquitin is released for reutilization by ubiquitin protease. The UCS is implicated 

30 in the degradation of mitotic cyclic kinases, oncoproteins, tumor suppressor genes such as p53, viral 
proteias, cell surface receptors associated with signal transduction, transcriptional regulators, and 
mutated or damaged proteins (Ciechanover, A. (1994) Cell 79:1 3-21). A murine proto-oncogene, Unp, 
encodes a nuclear ubiquitin protease whose overexpression leads to oncogenic transformation of NIH3T3 
cells, and the human homolog of this gene is consistently elevated in small cell tumors and 

35 adenocarcinomas of the lung (Gray, D A. (1995) Oncogene 10:2179-2183). 

2 
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Protease inhibitors and other regulators of protease activity control the activity and effects of 
proteases. Protease inhibitors have been shown to control pathogenesis in animal models of proteolytic 
disorders (Murphy, G. (1991) Agents Actions Suppl. 35:69-76). Low levels of the cystauns, low 
molecular weight inhibitors of the cysteine proteases, correlate with malignant progression of tumors 
5 (Calkins, C. et al (1995) Biol. Biochem. Hoppe Seyler 376:71-80). 

The plasma inter-a-trypsin inhibitor family molecules are serine protease inhibitors (serpins) 
composed of a 240 kDa plasma protein complex of at least five different types of glycoproteins. These 
glycoproteias consist of four heavy (H) chains and one 30 kDa light (L) chain named HI, H2, H3, H4, 
and L, and are independently synthesized and proteolytically processed from precursor proteins (Davcau, 

10 M. et al. (1998) Arch. Biochem. Biophys. 350:315-323; and Salier, J. P. et al. (1992) Mamm. Genome 
2:233-239). The plasma inter-a-trypsin inhibitor light chains have sequence similarity to the Kunitz 
trypsin inhibitors which appear to be present in all vertebrates (Salier, J.P. (1990) Trends Biochem. Sci. 
15:435-439). Some examples of the Kunitz trypsin inhibitors are tissue factor pathway inhibitor, which 
regulates tissue factor-induced coagulation, and protease nexin-2, which regulates serum coagulation 

15 factor XIa. (Broze, G.J. (1995) Annu. Rev. Med. 46:103-1 12; and Wagner, S.L. et al. (1993) Brain Res. 
626:90-98). The heavy chain precursors encode a signal peptide sequence and the mature chain. Other 
plasma inter-a-trypsin inhibitor heavy chains have been described in human and rodents (Bourguignon, J. 
et al. (1993) Eur. J. Biochem.212:771-776; Salier, 1992, supra ; and Salier, J.P. (1996) Biochem. J. 
315:1-9). The expression of the rat plasma inter-a-trypsin inhibitor genes is regulated by inflammation 

20 in vivo . The genes are predominantly expressed in the rat liver, but H2 and H3 mRNA is also present in 
brain, intestine, and stomach (Daveau, supra .). 

Kallistatins are members of the serine protease inhibitor family. Kallistatin forms a specific and 
covalently-linked complex with tissue kallikrein, which is a serine proteinase capable of cleaving 
kininogen to release vasoactive kinin. Components of the tissue kallikrein-kinin system include tissue 

25 kallikrein, kallistatin, kininogen, kinin, bradykininBl and B2 receptors, and kininases (Chao, J. and L. 
Chao (1995) Biol. Chem. Hoppe Seyler 376:705-713). 

Proteases and protease inhibitory molecules may contain amino acid sequence motifs which 
determine protein-protein interactions, such as the potential metal-binding site of von Willebrand factor 
type A3 (vWFA3) motif, glycine-amino acid-serine-amino acid-serine. This motif is also required for 

30 ligand interaction in the homologous I-type domains of integrins CR3 and LFA-1 (Huizinga, E.G. (1997) 
Structure 5:1 147-1 156). 

Protease inhibitors play a major role in the regulation of the activity and effect of proteases. 
They have been shown to control pathogenesis in animal models of proteolytic disorders and in the 
treatment of HIV (Murphy, G. (1991) Agents Actions Suppl. 35:69-76; and Pakyz, A. and D. Israel 

35 (1997) J. Am. Pharm. Assoc. (Wash.) NS37:543-551). 

3 



01 10903A2 



WO 01/10903 



PCT/l'SOO/21878 



The discovery of new proteases and protease inhibitors and trie polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of cell proliferative and autoimmune/inflammatory disorders. 

5 SUMMARY OF THE INVENTION 

The invention features purified polypeptides, proteases and protease inhibitors, referred to 
collectively as "PPIM" and individually as "PPIM-l" "PPIM-2," "PPIM-3," "PPIM-4," "PPIM-5," 
"PPIM-6," "PPIM-7," "PPIM-8," "PPIM-9," "PPIM-10," "PPIM-1 1," "PPIM-12," "PPIM-13," 
"PPIM-14," "PPIM-15," "PPIM- 16." "PPIM-17," "PPIM-18," "PPIM-19," "PPIM-20," "PPIM-21," 

10 "PPIM-22," "PPIM-23," "PPIM-24," "PPIM-25," "PPIM-26," and "PPIM-27." In one aspect, the 
invention provides an isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a 
naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an 

15 amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27. In one 
alternative, the invention provides an isolated polypeptide comprising the amino acid sequence of SEQ 
ID NO: 1-27. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 

20 amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, c) a 
biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group coasisting 

25 of SEQ ID NO: 1 -27. In one alternative, the polynucleotide encodes a polypeptide selected from the 
group consisting of SEQ ID NO: 1 -27. In another alternative, the polynucleotide is selected from the 
group consisting of SEQ ID NO:28-54. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid sequence 

30 selected from the group consisting of a) an amino acid sequence selected from the group coasisting of 
SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 90% sequence identity to 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 

35 27. In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. In 

4 
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another alternative, the invention provides a transgenic organism comprising the recombinant 
polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
5 consisting of SEQ ID NO: 1 -27, b) a naturally occurring amino acid sequence having at least 90% 

sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, c) a 
biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-27. The method comprises a) culturing a cell under conditions suitable for expression 

1 0 of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering the 
polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 

15 sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l -27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1 -27, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-27. 

20 The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 

25 an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 

30 SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in 

35 the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 

5 
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whereby a hybridization complex is formed between said probe and said target polynucleotide or 
fragments thereof, and b) detecting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

5 The invention further provides a method for detecting a target polynucleotide in a sample, said 

target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 

10 polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or fragment 
thereof using polymerase chain reaction amplification, and b) detecting the presence or absence of said 
amplified target polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 

15 comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected 

20 from the group consisting of SEQ ID NO: 1 -27, and a pharmaceutically acceptable excipienL In one 
embodiment, the composition comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27. The invention additionally provides a method of treating a disease or condition 
associated with decreased expression of functional PPIM, comprising administering to a patient in need 
of such treatment the composition. 

25 The invention also provides a method for screening a compound for effectiveness as an agonist 

of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected 

30 from the group coasisting of SEQ ID NO: 1 -27, and d) an immunogenic fragment of an amino acid 

sequence selected from the group consisting of SEQ ID NO: 1-27. The method comprises a) exposing a 
sample comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. In 
one alternative, the invention provides a composition comprising an agonist compound identified by the 
method and a pharmaceutically acceptable excipient. In another alternative, the invention provides a 

35 method of treating a disease or condition associated with decreased expression of functional PPIM, 
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comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as an 
antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -27, b) a naturally occurring 
5 amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 -27. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
10 the sample. In one alternative, the invention provides a composition comprising an antagonist 
compound identified by the method and a pharmaceutically acceptable excipient. In another 
alternative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional PPIM, comprising administering to a patient in need of such treatment the 
composition. 

15 The invention further provides a method of screening for a compound that specifically binds to 

a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected 

20 from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid 

sequence selected from the group consisting of SEQ ID NO: 1-27. The method comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) detecting binding of 
the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

25 The invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino 

30 acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 

35 test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
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change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in altering 
expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence selected 
5 from the group consisting of SEQ ID NO:28-54, the method comprising a) exposing a sample 

comprising the target polynucleotide to a compound, and b) detecting altered expression of the target 
polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 

10 hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:28- 
54, ii) a naturally occurring polynucleodde sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, iii) a polynucleotide 

15 sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex 
is formed between said probe and a target polynucleotide in the biological sample, said target 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, ii) a naturally 

20 occurring polynucleotide sequence having at least 70% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:28-54, iii) a polynucleotide sequence 
complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent of 
i)-iv). Alternatively, the target polynucleotide comprises a fragment of a polynucleotide sequence 
selected from the group consisting of i)-v) above; c) quantifying the amount of hybridization complex; 

25 and d) comparing the amount of hybridization complex in the treated biological sample with the 

amount of hybridization complex in an untreated biological sample, wherein a diff erence in the amount 
of hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 



30 BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), clone 
identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full-length 
sequences encoding PPIM. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
35 sequences, and methods, algorithms, and searchable databases used for analysis of PPIM. 
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Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
5 encoding PPIM were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

10 Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will be 
limited only by the appended claims. 

1 5 it must be noted that as used herein and in the appended claims, the singular forms "a," "an," 

and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 
Unless defined otherwise, all technical and scientific terms used herein have the same meanings 

20 as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 

25 connection with the invention. Nothing herein is to be construed as an admission that the invention is not 
entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"PPIM" refers to the amino acid sequences of substantially purified PPIM obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and human, 
30 and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
PPIM. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of PPIM either by directly interacting with PPIM 
or by acting on components of the biological pathway in which PPIM participates. 
35 An "allelic variant" is an alternative form of the gene encoding PPIM. Allelic variants may 
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result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes winch give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
5 Each of these types of changes may occur alone, or in combination with the others, one or more times in 
a given sequence. 

"Altered" nucleic acid sequences encoding PPIM include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as PPIM or a 
polypeptide with at least one functional characteristic of PPIM. Included within this definition are 

10 polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding PPIM, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding PPIM. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent PPIM. Deliberate 

1 5 amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or 
immunological activity of PPIM is retained. For example, negatively charged amino acids may include 
aspartic acid and glutamic acid, and positively charged amino acids may include lysine and arginine. 
Amino acids with uncharged polar side chains having similar hydrophilicity values may include: 

20 asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains having 
similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; and 
phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

25 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 

protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence to 
the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known in 

30 the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
PPIM. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of PPIM either by 
directly interacting with PPIM or by acting on components of the biological pathway in which PPIM 
35 participates. 
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The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind PPIM polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used to 
5 immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet 
hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that makes 

10 contact with a particular antibody. When a protein or a fragment of a protein is used to immunize a host 
animal, numerous regions of the protein may induce the production of antibodies which bind specifically 
to antigenic determinants (particular regions or three-dimensional structures on the protein). An 
antigenic determinant may compete with the intact antigen (i.e., the immunogen used to elicit the immune 
respoase) for binding to an antibody. 

15 The term "antisense" refers to any composition capable of base-pairing with the "sense" (coding) 

strand of a specific nucleic acid sequence. Antisense compositioas may include DNA; RNA; peptide 
nucleic acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, 
methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2'- 
methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5- 

20 methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be produced 
by any method including chemical synthesis or transcriptioa Once introduced into a cell, the 
complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence produced 
by the cell to form duplexes which block either transcription or translation. The designation "negative" 
or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can refer to the 

25 sease strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic PPIM, or of any oligopeptide thereof, to 
induce a specific immune respoase in appropriate animals or cells and to bind with specific antibodies. 

30 "Complementary" describes the relationship between two single-stranded nucleic acid sequences 

that anneal by base-pairing. For example, 5'-AGT-3' pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
35 amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
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Compositions comprising polynucleotide sequences encoding PP1M or fragments of PPIM may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be associated 
with a stabilizing agent such as a carbohydrate. In hybridizauoas, the probe may be deployed in an 
aqueous solution containing salts (e.g., NaCl), detergents (e.g.. sodium dodecyl sulfate; SDS), and other 

5 components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from one 
or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for fragment 

10 assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap (University 
of Washington, Seattle WA). Some sequences have been both extended and assembled to produce the 
consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
15 protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 


20 


Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 


25 


Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




He 


Leu, Val 




Leu 


He, Val 


30 


Lys 


Arg, Gin, Glu 




Met 


Leu, lie 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 


35 


Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
40 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
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side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
5 modifications of a polynucleotide sequence can include, lor example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

10 A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 

measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of PPIM or the polynucleotide encoding PPIM which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up to 
the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 

15 fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 

20 from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO: 28-54 comprises a region of unique polynucleotide sequence that 
25 specifically identifies SEQ ID NO:28-54, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO: 28-54 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:28-54 from related polynucleotide sequences. The precise length of a fragment of SEQ ID 
NO:28-54 and the region of SEQ ID NO:28-54 to which the fragment corresponds are routinely 
30 determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-27 is encoded by a fragment of SEQ ID NO:28-54. A fragment 
of SEQ ID NO: 1-27 comprises a region of unique amino acid sequence that specifically identifies SEQ 
ID NO: 1-27. For example, a fragment of SEQ ID NO: 1-27 is useful as an immunogenic peptide for 
the development of antibodies that specifically recognize SEQ ID NO: 1 -27. The precise length of a 
35 fragment of SEQ ID NO: 1 -27 and the region of SEQ ID NO: 1 -27 to which the fragment corresponds 
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are routinely determinable by one of ordinary skill in the art based on the intended purpose for the 
fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A "full- 
5 length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two or 
more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 

10 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence 

1 5 alignment program. This program is part of the L ASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For 
pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, 
gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as 

20 the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between aligned 
polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 

25 sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 

30 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST programs 
are commonly used with cap and other parameters set to default settings. For example, to compare two 
nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 2.0.12 (April-21- 
2000) set at default parameters. Such default parameters may be, for example: 

35 Matrix: BLOSUM62 
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Reward for match: 1 
Penalty for mismatch : -2 
Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
5 Expect: 10 

Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the 

10 length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

15 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to the 

20 percentage of residue matches between at least two polypeptide sequences aligned using a standardized 
algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment methods take 
into account conservative amino acid substitutions. Such conservative substitutions, explained in more 
detail above, generally preserve the charge and hydrophobicity at the site of substitution, thus preserving 
the structure (and therefore function) of the polypeptide. 

25 Percent identity between polypeptide sequences may be determined using the default parameters 

of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3. 1 2e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 

30 polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0. 1 2 
(Apr-2 1-2000) with blastp set at default parameters. Such default parameters may be, for example: 

35 Matrix: BLOSUM62 
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Open Gap: 1 1 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
5 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, lor 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, a 
fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 1 50 

10 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 

15 chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid sequence 
in the non-antigen binding regions has been altered so that the antibody more closely resembles a human 
antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 

20 complementary strand through base pairing under defined hybridization conditions. Specific 

hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 

25 binding between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and may 
be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditioas occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 

30 SDS, and about 100 ug/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 5°C 
to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic strength 
and pH. The T,„ is the temperature (under defined ionic strength and pH) at which 50% of the target 

35 sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions for 
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nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 1989, Molecular 
Cloning: A Laboratory Manual , 2 nJ ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specitlcally 
see volume 2, chapter 9. 

Hich stringency conditions for hybridization between polynucleotides of the present invention 
5 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as 

10 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNAiDNA hybridizations. Useful variatioas on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

15 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or R 0 t analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells or 

20 their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 

25 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular and 
systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of PPIM which is 
capable of eliciting an immune response when introduced into a living organism, for example, a mammal. 
The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of PPIM 

30 which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

35 The term "modulate" refers to a change in the activity of PPIM. For example, modulation may 
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cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of PP1M. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
5 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 

10 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which comprises 
an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of amino acid 
residues ending in lysine. The terminal lysine confers solubility to the compositioa PNAs preferentially 

15 bind complementary single stranded DNA or RNA and stop transcript elongation, and may be pegylated 
to extend their lifespan in the cell. 

"Post-translational modification" of an PPIM may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 

20 cell type depending on the enzymatic milieu of PPIM. 

"Probe" refers to nucleic acid sequences encoding PPIM, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels 
include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short 

25 nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 

complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 

30 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also be 
employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, or 
at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may be 
considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

35 Methods for preparing and using probes and primers arc described in the references, for example 
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Sambrook J. et al. (1989) Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring 
Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular Biology , 
Greene Publ. Assoc. & Wiley-lntersciences, New York NY; Inrus, M. et al. (1990) PCR Protocols. A 
Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be derived 
5 from a known sequence, for example, by using computer programs intended for that purpose such as 
Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 

10 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences and 
is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection program 

15 (available to the public from the Whitehead Iastitute/MIT Center for Genome Research, Cambridge MA) 
allows the user to input a "misprinting library," in which sequences to avoid as primer binding sites are 
user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for microarrays. 
(The source code for the latter two primer selection programs may also be obtained from their respective 
sources and modified to meet the user's specific needs.) The PrimeGen program (available to the public 

20 from the UK Human Genome Mapping Project Resource Centre, Cambridge UK) designs primers based 
on multiple sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is useful for 
identification of both unique and conserved oligonucleotides and polynucleotide fragments. The 
oligonucleotides and polynucleotide fragments identified by any of the above selection methods are useful 

25 in hybridization technologies, for example, as PCR or sequencing primers, microarray elements, or 
specific probes to identify fully or partially complementary polynucleotides in a sample of nucleic acids. 
Methods of oligonucleotide selection are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence that 
is made by an artificial combination of two or more otherwise separated segments of sequence. This 

30 artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques such as those 
described in Sambrook, supra . The term recombinant includes nucleic acids that have been altered solely 
by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a recombinant nucleic 
acid may include a nucleic acid sequence operably linked to a promoter sequence. Such a recombinant 

35 nucleic acid may be part of a vector that is used, for example, to transform a cell. 
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Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological respoase in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
5 regioas of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions (UTRs). 
Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
10 chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and other 

moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear sequence 
of nucleotides as the reference DNA sequence with the exception that all occurrences of the nitrogenous 
base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of 
15 deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic acids 
encoding PPIM, or fragments thereof, or PPIM itself, may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, in 
solution or bound to a substrate; a tissue; a tissue print; etc. 

20 The terms "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 

25 A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The tarn "substantially purified" refers to nucleic acid or amino acid sequences that are removed 
from their natural environment and are isolated or separated, and are at least 60% free, preferably at 
least 75% free, and most preferably at least 90% free from other components with which they are 

30 naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, chips, 
slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, microparticles and 

35 capillaries. The substrate can have a variety of surface forms, such as wells, trenches, pins, channels 
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and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
5 cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method lor transformation is selected based on the type of 
host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term "transformed" cells 

10 includes stably transformed cells in which the inserted DNA is capable of replication either as an 

autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 
and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 

15 introduced by way of human intervention, such as by transgenic techniques well known in the an. The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro 
fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The transgenic 

20 organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, 
fungi, plants, and animals. The isolated DNA of the present invention can be introduced into the host 
by methods known in the art, for example infection, transfection, transformation or tiansconjugation. 
Techniques for transferring the DNA of the present invention into such organisms are widely known 
and provided in references such as Sambrook, J. et al. (1989), supra . 

25 A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 

least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 

30 sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides due 
to alternative splicing of exons during mRNA processing. The corresponding polypeptide may possess 
additional functional domains or lack domains that are present in the reference molecule. Species 

35 variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides 
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generally will have significant amino acid identity relative to each other. A polymorphic variant is a 
variation in the polynucleotide sequence of a particular gene between individuals of a given species. 
Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the 
polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, for 

5 example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 

10 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence identity 
over a certain defined length of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human proteases and protease inhibitors (PPIM), 
the polynucleotides encoding PPIM, and the use of these compositions for the diagnosis, treatment, or 

15 prevention of cell proliferative and autoimmune/inflammatory disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding PPIM. 
Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide and 
nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which nucleic 
acids encoding each PPIM were identified, and column 4 shows the cDNA libraries from which these 

20 clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA libraries. Clones for 
which cDNA libraries are not indicated were derived from pooled cDNA libraries. In some cases, 
GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank cDNA 
sequences, where indicated, in column 5 were used to assemble the consensus nucleotide sequence of 
each PPIM and are useful as fragments in hybridization technologies. 

25 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 

30 and in some cases, searchable databases to which the analytical methods were applied. The methods of 
column 7 were used to characterize each polypeptide through sequence homology and protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding PPIM. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments are 

35 useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:28-54 and 
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to distinguish between SEQ ID NO:28-54 and related polynucleotide sequences. The polypeptides 
encoded by the selected fragments of SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31. 
SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:39, 
SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, 
5 SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51. 
SEQ ID NO:52, SEQ ID NO:53, and SEQ ID NO:54 are useful, for example, as immunogenic peptides. 
Column 3 lists tissue categories which express PPIM as a fraction of total tissues expressing PPIM. 
Column 4 lists diseases, disorders, or conditions associated with those tissues expressing PPIM as a 
fraction of total tissues expressing PPIM. Column 5 lists the vectors used to subclone each cDNA 
10 library. Of particular note is the expression of SEQ ID NO:28 in gastrointestinal tissue. Of particular 
note is the tissue-specific expression of SEQ ID NO:5 1 . Over 83% of the tissues expressing SEQ ID 
NO:51 are derived from gastrointestinal tissue, particularly the liver. 

The columns of" Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding PPIM were isolated. Column 1 references the nucleotide SEQ ID 
15 NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:30 maps to chromosome 9 within the interval from 78.4 to 90.6 centiMorgans. 
This interval also contains a gene associated with cell proliferation. 

SEQ ID NO:37 maps to chromosome 12 within the interval from 1 16.6 to 1 18.9 centiMorgans. 
20 This interval also contains a gene associated with a neurological disorder. 

SEQ ID NO:47 maps to chromosome 4 within the interval from 99.2 to 105.2 centiMorgans. 
This interval also contains a gene associated with cardiovascular disease. 

The invention also encompasses PPIM variants. A preferred PPIM variant is one which has at 
least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
25 identity to the PPIM amino acid sequence, and which contains at least one functional or structural 
characteristic of PPIM. 

The invention also encompasses polynucleotides which encode PPIM. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:28-54, which encodes PPIM. The polynucleotide sequences of SEQ 
30 ID NO:28-54, as presented in the Sequence Listing, embrace the equivalent RNA sequences, wherein 
occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding PPIM. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
35 about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
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encoding PPIM. A particular aspect of the invention encompasses a variant of a polynucleotide sequence 
comprising a sequence selected from the group consisting of SEQ ID NO:28-54 which has at least about 
70%, or alternatively at least about 85%, or even at least about 95% polynucleotide sequence identity to 
a nucleic acid sequence selected from the group consisting of SEQ ID NO:28-54. Any one of the 

5 polynucleotide variants described above can encode an amino acid sequence which contains at least one 
functional or structural characteristic of PPIM. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding PPIM, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 

10 invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in accordance 
with the standard triplet genetic code as applied to the polynucleotide sequence of naturally occurring 
PPIM, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode PPIM and its variants are generally capable of 

1 5 hybridizing to the nucleotide sequence of the naturally occurring PPIM under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding PPIM or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 

20 are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding PPIM 
and its derivatives without altering the encoded amino acid sequences include the production of RNA 
transcripts having more desirable properties, such as a greater half-life, than transcripts produced from 
the naturally occurring sequence. 

The invention also encompasses production of DN A sequences which encode PPIM and PPIM 

25 derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using 
reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into a 
sequence encoding PPIM or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of hybridizing 

30 to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID NO:28-54 and 
fragments thereof under various conditions of stringency. (See, e.g., Wahl, CM. and S.L. Berger (1987) 
Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-51 1.) Hybridization 
conditions, including annealing and wash conditions, are described in "Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 

35 embodiments of the invention. The methods may employ such enzymes as the Klcnow fragment of DNA 
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polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE Biosystems, Foster 
City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found in the ELONG ASE amplification 
system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is automated with 
5 machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal 
cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (PE Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (PE 
Biosystems), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA), or 
other systems known in the art. The resulting sequences are analyzed using a variety of algorithms 

10 which are well known in the art. (See, e.g.. Ausubel. F.M. (19971 Short Protocols in Molecular Biology , 
John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and 
Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding PPIM may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 

15 such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) Another 
method, inverse PCR, uses primers that extend in divergent directions to amplify unknown sequence 
from a circularized template. The template is derived from restriction fragments comprising a known 

20 genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 
16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent to 
known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 

25 sequence before performing PCR. Other methods which may be used to retrieve unknown sequences are 
known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). Additionally, 
one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to 
walk genomic DNA. This procedure avoids the need to screen libraries and is useful in finding 
intron/exon junctions. For all PCR-based methods, primers may be designed using commercially 

30 available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth 

MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 

35 sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 

25 



■ WO D11D903A2 I > 



W O 01/10903 



PCT/LSOO/21878 



does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which arc commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 

5 sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire process 
from loading of samples to computer analysis and electronic data display may be computer controlled. 

10 Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode PPIM may be cloned in recombinant DNA molecules that direct expression of PPIM, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 

15 the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express PPIM. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter PPIM-encoding sequences for a variety of purposes including, but not 
limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 

20 shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

25 as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 
the biological properties of PPIM, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 

30 produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 

35 rccombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
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fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

In another embodiment, sequences encoding PPIM may be synthesized, in whole or in part, using 
5 chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids Symp. 
Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, PPIM itself 
or a fragment thereof may be synthesized using chemical methods. For example, peptide synthesis can 
be performed using various solution-phase or solid-phase techniques. (See, e.g., Creighton, T. (1984) 
Proteins. Structures and Molecular Properties , WH Freeman, New York NY, pp. 55-60; and Roberge, 
10 J.Y. etal. (1995) Science 269:202-204.) Automated synthesis may be achieved using the AB I 431 A 
peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of PPIM, or any part 
thereof, may be altered during direct synthesis and/or combined with sequences from other proteins, or 
any part thereof, to produce a variant polypeptide or a polypeptide having a sequence of a naturally 
occurring polypeptide. 

15 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active PPIM, the nucleotide sequences encoding PPIM or 

20 derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. These elements include regulatory sequences, such as enhancers, constitutive and inducible 
promoters, and 5' and 3' untranslated regions in the vector and in polynucleotide sequences encoding 
PPIM. Such elements may vary in their strength and specificity. Specific initiation signals may also be 

25 used to achieve more efficient translation of sequences encoding PPIM. Such signals include the ATG 
initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where sequences encoding 
PPIM and its initiation codon and upstream regulatory sequences are iaserted into the appropriate 
expression vector, no additional transcriptional or translational control signals may be needed. However, 
in cases where only coding sequence, or a fragment thereof, is inserted, exogenous translational control 

30 signals including an in-frame ATG initiation codon should be provided by the vector. Exogenous 

translational elements and initiation codoas may be of various origins, both natural and synthetic. The 
efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the particular 
host cell system used. (See, e.g., Scharf, D. ct al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 

35 vectors containing sequences encoding PPIM and appropriate transcriptional and translational control 
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elements. These methods include in vitro recombinant DN A techniques, synthetic techniques, and in vivo 
genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, 
Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) Current 
Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

5 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding PPIM. These include, but are not limited to, microorganisms such as bacteria transformed with 
recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast 
expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); plant 
cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or tobacco 

10 mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell 
systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster (1989) J. 
Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; Scorer, C.A. 
et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 
91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, N. (1987) EMBO 

15 J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 
224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The McGraw Hill 
Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and 
T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia 

20 viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. Ther. 
5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. et al. 
(1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 31(3):219-226; and 
Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell 

25 employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding PPIM. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding PPIM can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 

30 (Life Technologies). Ligation of sequences encoding PPIM into the vector's multiple cloning site 
disrupts the lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
deletions in the cloned sequence. (See, e.g.. Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 

35 264:5503-5509.) When large quantities of PPIM arc needed, e.g. for the production of antibodies, 
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vectors which direct high level expression of PPIM may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of PPIM. A number of vectors containing 
constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH promoters, may be 
5 used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such vectors direct either the 
secretion or intracellular retention of expressed proteins and enable integration of foreign sequences into 
the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra ; Bitter, supra ; and Scorer, 
supra .) 

Plant systems may also be used for expression of PPIM. Transcription of sequences encoding 

10 PPIM may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, supra ; Broglie, supra ; and Winter, supra .) These constructs can be introduced 
into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, e.g., The 

15 McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases where 
an adenovirus is used as an expression vector, sequences encoding PPIM may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 

20 infective virus which expresses PPIM in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. Natl. 
Acad. Sci. US A 8 1 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma virus 
(RSV) enhancer, may be used to increase expression in mammalian host cells. S V40 or EBV-based 
vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

25 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 

constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, or 
vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
PPIM in cell lines is preferred. For example, sequences encoding PPIM can be transformed into cell 

30 lines using expression vectors which may contain viral origins of replication and/or endogenous 

expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 

35 introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
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culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1 .223-232; 
5 Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. ct al. (1980) 
Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 

10 Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B 
glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 

15 transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding PPIM is inserted within a marker gene sequence, transformed cells containing 

20 sequences encoding PPIM can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding PPIM under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding PPIM and that express 

25 PPIM may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein 
bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for 
the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of PPIM using either 

30 specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering epitopes on PPIM is preferred, but a competitive binding assay may be 
employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et al. (1990) 

35 Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, Sect. IV; Coligan, J.E. et al. 
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(1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-lnterscience, New York 
NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means lor producing labeled hybridization 
5 or PCR probes for detecting sequences related to polynucleotides encoding PPIM include oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labeled nucleotide. Alternatively, the 
sequences encoding PPIM, or any fragments thereof, may be cloned into a vector for the production of an 
mRNA probe. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 

10 and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

15 Host cells transformed with nucleotide sequences encoding PPIM may be cultured under 

conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or die vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode PPIM may be designed to contain signal sequences which direct secretion 

20 of PPIM through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the inserted 
sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 

25 protein may also be used to specify protein targeting, folding, and/or activity. Different host cells which 
have specific cellular machinery and characteristic mechanisms for post-traaslational activities (e.g., 
CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture Collection 
(ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the 
foreign protein. 

30 In another embodiment of the invention, natural, modified, or recombinant nucleic acid 

sequences encoding PPIM may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric PPIM protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of PPIM activity. Heterologous protein and peptide moieties 

35 may also facilitate purification of fusion proteins using commercially available affinity matrices. Such 
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moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein (MBP), 
thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). 
GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on immobilized 
glutathione, maltose, phcnylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c- 
5 myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using commercially 
available monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A fusion 
protein may also be engineered to contain a proteolytic cleavage site located between the PPIM encoding 
sequence and the heterologous protein sequence, so that PPIM may be cleaved away from the 
heterologous moiety following purification. Methods for fusion protein expression and purification are 

10 discussed in Ausubel (1995, supra , ch. 10). A variety of commercially available kits may also be used to 
facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled PPIM may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 

15 SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

PPIM of the present invention or fragments thereof may be used to screen for compounds that 
specifically bind to PPIM. At least one and up to a plurality of test compounds may be screened for 
specific binding to PPIM. Examples of test compounds include antibodies, oligonucleotides, proteins 

20 (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
PPIM, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which PPIM 

25 binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express PPIM, either as a secreted protein 
or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or E. coli . 
Cells expressing PPIM or cell membrane fractions which contain PPIM are then contacted with a test 

30 compound and binding, stimulation, or inhibition of activity of either PPIM or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, the 
assay may comprise the steps of combining at least one test compound with PPIM, either in solution or 
35 affixed to a solid support, and detecting the binding of PPIM to the compound. Alternatively, the 
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assay may detect or measure binding of a test compound in the presence of a labeled competitor. 
Additionally, the assay may be carried out using cell free preparations, chemical libraries, or natural 
product mixtures, and the test compound(s) may be free in solution or affixed to a solid support. 

PPIM of the present invention or fragments thereof may be used to screen for compounds that 
5 modulate the activity of PPIM. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive lor PPIM 
activity, wherein PPIM is combined with at least one test compound, and the activity of PPIM in the 
presence of a test compound is compared with the activity of PPIM in the absence of the test 
compound. A change in the activity of PPIM in the presence of the test compound is indicative of a 

10 compound that modulates the activity of PPIM. Alternatively, a test compound is combined with an in 
vitro or cell-free system comprising PPIM under conditions suitable for PPIM activity, and the assay is 
performed. In either of these assays, a test compound which modulates the activity of PPIM may do so 
indirectly and need not come in direct contact with the test compound. At least one and up to a 
plurality of test compounds may be screened. 

15 In another embodiment, polynucleotides encoding PPIM or their mammalian homologs may 

be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For example, 
mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 

20 grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 
244: 1 288- 1 292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 

25 (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pscudopregnant dams, and the 
resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

30 Polynucleotides encoding PPIM may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A. et al. 
(1998) Science 282:1145-1 147). 

35 Polynucleotides encoding PPIM can also be used to create "knockin" humanized animals 
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(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding PP1M is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into hlastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
5 potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 
a mammal inbred to overexpress PPIM, e.g., by secreting PPIM in its milk, may also serve as a 
convenient source of that protein (Jamie, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 

10 regions of PPIM and proteases and protease inhibitors. In addition, the expression of PPIM is closely 
associated with cell proliferation, inflammation, the immune response, and gastrointestinal, 
neurological, and reproductive tissue. Therefore, PPIM appears to play a role in cell proliferative and 
autoimmune/inflammatory disorders. In the treatment of disorders associated with increased PPIM 
expression or activity, it is desirable to decrease the expression or activity of PPIM. In the treatment of 

15 disorders associated with decreased PPIM expression or activity, it is desirable to increase the 
expression or activity of PPIM. 

Therefore, in one embodiment, PPIM or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of PPIM. 
Examples of such disorders include, but are not limited to, a cell proliferative disorder, such as actinic 

20 keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, 
brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, 

25 ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
uterus; and an autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome 
(AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 

30 cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema 
nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, 
Hashimoto's thyroiditis, hypcreosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia 
gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatine, polymyositis, 

35 psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
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anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma. 

In another embodiment, a vector capable of expressing PPIM or a fragment or derivative thereof 
5 may be administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PPIM including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified PPIM in conjunction 
with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a disorder 
associated with decreased expression or activity of PPIM including, but not limited to, those provided 
10 above. 

In still another embodiment, an agonist which modulates the activity of PPIM may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of PPIM including, but not limited to, those listed above. 

In a further embodiment, an antagonist of PPIM may be administered to a subject to treat or 

15 prevent a disorder associated with increased expression or activity of PPIM. Examples of such 

disorders include, but are not limited to, those cell proliferative and autoimmune/inflammatory disorders 
described above. In one aspect, an antibody which specifically binds PPIM may be used directly as an 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells 
or tissues which express PPIM. 

20 In an additional embodiment, a vector expressing the complement of the polynucleotide encoding 

PPIM may be administered to a subject to treat or prevent a disorder associated with increased 
expression or activity of PPIM including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 

25 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination of 
therapeutic agents may act synergistically to effect the treatment or prevention of the various disorders 
described above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

30 An antagonist of PPIM may be produced using methods which are generally known in the art. 

In particular, purified PPIM may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind PPIM. Antibodies to PPIM may also be generated using 
methods that are well known in the art. Such antibodies may include, but arc not limited to, polyclonal, 
monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab 

35 expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally 
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preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 
others may be immunized by injection with PPIM or with any fragment or oligopeptide thereof which has 
immunogenic properties. Depending on the host species, various adjuvants may be used to increase 

5 immunological respoase. Such adjuvants include, but are not limited to, Freund's, mineral gels such as 
aluminum hydroxide, and surface active substances such as lysolecithm, pluronic polyols, polyanions, 
peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humaas, BCG (bacilli 
Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to PPIM 

10 have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein. Short stretches of PPIM amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 
may be produced. 

15 Monoclonal antibodies to PPIM may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. 
(See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 
81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and Cole, S.P. et al. 

20 (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen 
specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. Natl. Acad. 
Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, S. et al. 

25 (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single chain 
antibodies may be adapted, using methods known in the art, to produce PPIM-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) 
Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

30 Antibodies may also be produced by inducing in vivo production in the lymphocyte population or 

by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in the 
literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, G. et 
al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for PPIM may also be generated. For 

35 example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin digestion 
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of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab")2 
fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Science 246:1275-1281.) 
5 Various immunoassays may be used for screening to identify antibodies having the desired 

specificity. Numerous protocols for competitive binding or immunoradiomctric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between PPIM and its specific 
antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
10 non-interfering PPIM epitopes is generally used, but a competitive binding assay may also be employed 
(Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for PPIM. Affinity is expressed as an association 
constant, K,, which is defined as the molar concentration of PPIM-antibody complex divided by the 

15 molar concentrations of free antigen and free antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple PPIM 
epitopes, represents the average affinity, or avidity, of the antibodies for PPIM. The IC, determined for a 
preparation of monoclonal antibodies, which are monospecific for a particular PPIM epitope, represents 
a true measure of affinity. High-affinity antibody preparations with It, ranging from about 10 9 to 1()' 2 

20 L/mole are preferred for use in immunoassays in which the PPIM-antibody complex must withstand 
rigorous manipulations. Low-affinity antibody preparations with K, ranging from about 10 6 to 10 7 
L/mole are preferred for use in immunopurification and similar procedures which ultimately require 
dissociation of PPIM, preferably in active form, from the antibody (Catty, D. (1988) Antibodias, Volume 
I: A Practical Approach , IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical 

25 Guide to Monoclonal Antibodies , John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of PPIM-antibody 

30 complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for antibody 
quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and Coligan 
et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding PPIM, or any fragment or 
complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
35 expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
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PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding PPIM. Such 
technology is well known in the art, and antisense oligonucleotides or larger fragments can be designed 
from various locations along the coding or control regions of sequences encoding PPIM. (See, e.g., 
Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., TotawaNJ.) 
5 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (Sec, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 

10 9(1 3): 1288- 1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra ; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 

15 al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding PPIM may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X-linked 

20 inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 

immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene Therapy 
6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 

25 hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, R.G. 
(1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) express a 
conditionally lethal gene product (e.g., in the case of cancers which result from unregulated cell 
proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) Nature 

30 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), hepatitis B or C 
virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides brasiliensis ; and 
protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi ). In the case where a 
genetic deficiency in PPIM expression or regulation causes disease, the expression of PPIM from an 
appropriate population of transduced cells may alleviate the clinical manifestatioas caused by the genetic 

35 deficiency. 
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In a further embodiment of the invention, diseases or disorders caused by deficiencies in PPIM 
are treated by constructing mammalian expression vectors encoding PPIM and introducing these vectors 
by mechanical means into PPIM-deficient cells. Mechanical traasfer technologies for use with cells in 
vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold particle 
5 delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the use of 
DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191-217; Ivies, 
Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of PPIM include, but are not limited 
to, the PC DNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 

10 PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, PTET-ON, 
PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). PPIM may be expressed using (i) a 
constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus (RSV), SV40 
virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 89:5547- 

15 5551 ; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) Curr. 
Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK_506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra )), or (iii) a tissue-specific promoter or the native promoter of the endogenous gene 

20 encoding PPIM from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental parameters. 
In the alternative, transformation is performed using the calcium phosphate method (Graham, F.L. and 

25 A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. (1982) EMBO J. 
1 ;841 -845). The introduction of DNA to primary cells requires modification of these standardized 
mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to PPIM expression are treated by constructing a retrovirus vector consisting of (i) the 

30 polynucleotide encoding PPIM under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cii-acting RNA sequences and coding sequences required 
for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are commercially 
available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. Natl. Acad. Sci. 

35 USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an appropriate 
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vector producing cell line (VPCL) that expresses an envelope gene with a tropism for receptors on the 
target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. (1987) J. Virol. 
61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and A D. Miller (1988) 
J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et al. (1998) J. Virol. 
5 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining retrovirus packaging 
cell lines producing high transducing efficiency retroviral supernatant") discloses a method for obtaining 
retrovirus packaging cell lines and is hereby incorporated by reference. Propagation of retrovirus 
vectors, transduction of a population of cells (e.g., CD4* T-cells), and the return of transduced cells to a 
patient are procedures well known to persons skilled in the art of gene therapy and have been well 

10 documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259- 
2267; Bonyhadi, M L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. 
USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding PPIM to cells which have one or more genetic abnormalities with respect to the 

15 expression of PPIM. The construction and packaging of adenovirus-based vectors are well known to 
those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be versatile 
for importing genes encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. et 
al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. 
Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated 

20 by reference. For adenoviral vectors, see also Antinozzi, P. A et al. (1999) Annu. Rev. Nutr. 

19:51 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 
reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding PPIM to target cells which have one or more genetic abnormalities with respect 

25 to the expression of PPIM. The use of herpes simplex virus (HSV)-based vectors may be especially 
valuable for introducing PPIM to cells of the central nervous system, for which HSV has a tropism. The 
construction and packaging of herpes-based vectors are well known to those with ordinary skill in the art. 
A replication-competent herpes simplex virus (HSV) type 1 -based vector has been used to deliver a 
reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 169:385-395). The 

30 construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent Number 5,804,41 3 
to DeLuca ("Herpes simplex virus strains for gene traasfer"), which is hereby incorporated by reference. 
U.S. Patent Number 5,804,413 teaches the use of recombinant HSV d92 which consists of a genome 
containing at least one exogenous gene to be transferred to a cell under the control of the appropriate 
promoter for purposes including human gene therapy. Also taught by this patent are the construction 
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and use of recombinant HS V strains deleted for ICP4, ICP27 and 1CP22. For HS V vectors, see also 
Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. (1994) Dev. Biol. 163:152-161, hereby 
incorporated by reference. The manipulation of cloned herpesvirus sequences, the generation of 
recombinant virus following the transfection of multiple plasmids containing different segments of the 
5 large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding PPIM to target cells. The biology of the prototypic alphavirus, Semliki 
Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on the SFV 

10 genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During alphavirus RNA 
replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This 
subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the 
overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease and 
polymerase). Similarly, iaserting the coding sequence for PPIM into the alphavirus genome in place of 

15 the capsid-coding region results in the production of a large number of PPIM-coding RNAs and the 
synthesis of high levels of PPIM in vector transduced cells. While alphavirus infection is typically 
associated with cell lysis within a few days, the ability to establish a persistent infection in hamster 
normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication of 
alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S.A. et al. (1997) 

20 Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of PPIM into a 
variety of cell types. The specific transduction of a subset of cells in a population may require the 
sorting of cells prior to transduction. The methods of manipulating infectious cDNA clones of 
alphaviruses, performing alphavirus cDNA and RNA transfections, and performing alphavirus 
infections, are well known to those with ordinary skill in the art. 

25 Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 

and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can be 
achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 

30 been described in the literature. (See, e.g.. Gee, J.E. et al. (1994) in Huber, B E. and B.I. Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

35 RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
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molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, engineered 
hammerhead motif ribozyme molecules may specifically and efficiently catalyze endonucleolytic 
cleavage of sequences encoding PPIM. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
5 scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, GUU, 
and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, corresponding 
to the region of the target gene containing the cleavage site, may be evaluated for secondary structural 
features which may render the oligonucleotide inoperable. The suitability of candidate targets may also 
be evaluated by testing accessibility to hybridization with complementary oligonucleotides using 

10 ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 

15 encoding PPIM. Such DNA sequences may be incorporated into a wide variety of vectors with suitable 
RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends of 

20 the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

25 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding PPIM. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 

30 chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased PPIM 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding PPIM may be therapeutically useful, and in the treamentof disorders associated with 

35 decreased PPIM expression or activity, a compound which specifically promotes expression of the 
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polynucleotide encoding PPIM may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
5 altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a library 
of chemical compounds created combinatorially or randomly. A sample comprising a polynucleotide 
encoding PPIM is exposed to at least one test compound thus obtained. The sample may comprise, for 

10 example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted biochemical system. 
Alterations in the expression of a polynucleotide encoding PPIM are assayed by any method 
commonly known in the art. Typically, the expression of a specific nucleotide is detected by 
hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding PPIM. The amount of hybridization may be quantified, thus forming the 

15 basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a test 
compound indicates that the test compound is effective in altering the expression of the polynucleotide. 
A screen for a compound effective in altering expression of a specific polynucleotide can be carried 
out, for example, using a Schizosaccharomvces pombe gene expression system (Atkins, D. et al. 

20 (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a human 
cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 268:8-13). A 
particular embodiment of the present invention involves screening a combinatorial library of 
oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et al. 

25 (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. Delivery 
by transfection, by liposome injectioas, or by polycationic amino polymers may be achieved using 

30 methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. Biotechnol. 
15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

35 An additional embodiment of the invention relates to the administration of a composition which 
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generally comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remingtons 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of PPIM, 
5 antibodies to PPIM, and mimetics, agonists, antagonists, or inhibitors of PPIM. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

1 0 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are generally aerosolized immediately prior to inhalation by the patient. In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar region of the lung have enabled 

15 the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. et al., U.S. 

Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle injection, 
and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination of 

20 an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising PPIM or fragments thereof. For example, liposome preparations containing 
a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, PPIM or a fragment thereof may be joined to a short cationic N-terminal 

25 portion from the HIV Tat- 1 protein. Fusion proteins thus generated have been found to transduce into 
the cells of all tissues, including the brain, in a mouse model system (Schwarze, S R. et al. (1999) 
Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 

30 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example PPIM or 
fragments thereof, antibodies of PPIM, and agonists, antagonists or inhibitors of PPIM, which 

35 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
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standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating 
the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the dose lethal to 50% of 
the population) statistics. The dose ratio of toxic to therapeutic effects is the therapeutic index, which 
can be expressed as the LD 50 /ED 50 ratio. Compositions which exhibit large therapeutic indices are 
5 preferred. The data obtained from cell culture assays and animal studies are used to formulate a range of 
dosage for human use. The dosage contained in such compositions is preferably within a range of 
circulating concentrations that includes the ED 50 with little or no toxicity. The dosage varies within this 
range depending upon the dosage form employed, the sensitivity of the patient, and the route of 
administration. 

10 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity of 
the disease state, the general health of the subject, the age, weight, and gender of the subject, time and 
frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. Long- 

15 acting compositions may be administered every 3 to 4 days, every week, or biweekly depending on the 
half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 /^g to 100,000 ^g, up to a total dose of about 
1 gram, depending upon the route of administration. Guidance as to particular dosages and methods of 
delivery is provided in the literature and generally available to practitioners in the art. Those skilled in 

20 the art will employ different formulations for nucleotides than for proteins or their inhibitors. Similarly, 
delivery of polynucleotides or polypeptides will be specific to particular cells, conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind PPIM may be used for the diagnosis 
of disorders characterized by expression of PPIM, or in assays to monitor patients being treated with 

25 PPIM or agonists, antagonists, or inhibitors of PPIM. Antibodies useful for diagnostic purposes may be 
prepared in the same manner as described above for therapeutics. Diagnostic assays for PPIM include 
methods which utilize the antibody and a label to detect PPIM in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be labeled by 
covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter molecules, several 

30 of which are described above, are known in the art and may be used. 

A variety of protocols for measuring PPIM, including ELISAs, RIAs, and FACS, are known in 
the art and provide a basis for diagnosing altered or abnormal levels of PPIM expression. Normal or 
standard values for PPIM expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibody to PPIM under conditions 

35 suitable for complex formation. The amount of standard complex formation may be quantitated by 
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various methods, such as photometric means. Quantities of PPIM expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding PPIM may be used for 

5 diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of PPIM may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
PPIM, and to monitor regulation of PPIM levels during therapeutic intervention. 

10 In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding PPIM or closely related molecules may be used to 
identify nucleic acid sequences which encode PPIM. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 

15 probe identifies only naturally occurring sequences encoding PPIM, allelic variants, or related sequences. 
Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the PPIM encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:28-54 or from 
genomic sequences including promoters, enhancers, and introns of the PPIM gene. 

20 Means for producing specific hybridization probes for DNAs encoding PPIM include the cloning 

of polynucleotide sequences encoding PPIM or PPIM derivatives into vectors for the production of 
mRNA probes. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases and the 
appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, 

25 for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding PPIM may be used for the diagnosis of disorders associated 
with expression of PPIM. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 

30 connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 

35 salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an autoimmune/inflammatory 
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disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopatfvy-candidiasis- 
ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic 
5 dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with 

lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable 
bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid 

10 arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, 
protozoal, and helminthic infections, and trauma. The polynucleotide sequences encoding PPIM may be 
used in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 

15 technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or 
tissues from patients to detect altered PPIM expression. Such qualitative or quantitative methods are 
well known in the art. 

In a particular aspect, the nucleotide sequences encoding PPIM may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 

20 sequences encoding PPIM may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard value. 
If the amount of signal in the patient sample is significantly altered in comparison to a control sample 
then the presence of altered levels of nucleotide sequences encoding PPIM in the sample indicates the 

25 presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a 

particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment 
of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of PPIM, a 
normal or standard profile for expression is established. This may be accomplished by combining body 

30 fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 

fragment thereof, encoding PPIM, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects with values 
from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 

35 patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
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presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, hybridization 
assays may be repeated on a regular basis to determine if the level of expression in the patient begins to 
approximate that which is observed in the normal subject. The results obtained from successive assays 
5 may be used to show the efficacy of treatment over a period ranging from several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development of 
the disease, or may provide a means for detecting the disease prior to the appearance of actual clinical 
symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
10 preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding PPIM may 
involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, or 
produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding PPIM, or 
15 a fragment of a polynucleotide complementary to the polynucleotide encoding PPIM, and will be 

employed under optimized conditions for identification of a specific gene or condition. Oligomers may 
also be employed under less stringent conditions for detection or quantification of closely related DNA or 
RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 

20 encoding PPIM may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, 
insertions and deletions that are a frequent cause of inherited or acquired genetic disease in humans. 
Methods of SNP detection include, but are not limited to, single-stranded conformation polymorphism 
(SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from the 
polynucleotide sequences encoding PPIM are used to amplify DNA using the polymerase chain reaction 

25 (PCR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, bodily 
fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR 
products in single-stranded form, and these differences are detectable using gel electrophoresis in non- 
denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows detection 
of the amplimers in high-throughput equipment such as DNA sequencing machines. Additionally, 

30 sequence database analysis methods, termed in silico SNP (isSNP), are capable of identifying 

polymorphisms by comparing the sequence of individual overlapping DNA fragments which assemble 
into a common consensus sequence. These computer-based methods filter out sequence variatioas due to 
laboratory preparation of DNA and sequencing errors using statistical models and automated analyses of 
DNA sequence chromatograms. In the alternative, SNPs may be detected and characterized by mass 

35 spectrometry using, for example, the high throughput MASS ARRAY system (Sequenom, Inc., San 
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Diego CA). 

Methods which may also be used to quantify the expression of PPIM include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
5 al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 

10 polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described in Seilhamer, J.J. et al., "Comparative Gene Transcript 
Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 

15 determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used to 
develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 

20 display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, antibodies specific for PPIM, or PPIM or fragments thereof may be 
used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

25 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 

30 hybridizing the polynucleotides of the present invention or their complements to the totality of transcripts 
or reverse transcripts of a particular tissue or cell type. In one embodiment, the hybridization takes place 
in high-throughput format, wherein the polynucleotides of the present invention or their complements 
comprise a subset of a plurality of elements on a microarray. The resultant transcript image would 
provide a profile of gene activity. 

35 Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
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or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the case 
of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclinical evaluation of 
5 pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed molecular 
fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity (Nuwaysir, 
E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) Toxicol. Lett. 112- 
1 1 3:467-471, expressly incorporated by reference herein). If a test compound has a signature similar to 

10 that of a compound with known toxicity, it is likely to share those toxic properties. These fingerprints or 
signatures are most useful and refined when they contain expression information from a large number of 
genes and gene families. Ideally, a genome-wide measurement of expression provides the highest quality 
signature. Even genes whose expression is not altered by any tested compounds are important as well, as 
the levels of expression of these genes are used to normalize the rest of the expression data. The 

15 normalization procedure is useful for comparison of expression data after treatment with different 
compounds. While the assignment of gene function to elements of a toxicant signature aids in 
interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical 
matching of signatures which leads to prediction of toxicity. (See, for example, Press Release 00-02 
from the National Institute of Environmental Health Sciences, released February 29, 2000, available at 

20 http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in toxicological 
screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the present 

25 invention, so that transcript levels corresponding to the polynucleotides of the present invention may 
be quantified. The transcript levels in the treated biological sample are compared with levels in an 
untreated biological sample. Differences in the transcript levels between the two samples are 
indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 

30 invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global pattern 
of protein expression in a particular tissue or cell type. Each protein component of a proteome can be 
subjected individually to further analysis. Proteome expression patterns, or profiles, are analyzed by 
quantifying the number of expressed proteins and their relative abundance under given conditions and at 
a given time. A profile of a cell's proteome may thus be generated by separating and analyzing the 

35 polypeptides of a particular tissue or cell type. In one embodiment, the separation is achieved using two- 
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dimensional gel electrophoresis, in which proteins from a sample are separated by isoelectric focusing in 
the first dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel 
electrophoresis in the second dimension (Steiner and Anderson, supra ). The proteins are visualized in the 
gel as discrete and uniquely positioned spots, typically by staining the gel with an agent such as 
5 Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is generally 
proportional to the level of the protein in the sample. The optical densities of equivalently positioned 
protein spots from different samples, for example, from biological samples either treated or untreated 
with a test compound or therapeutic agent, are compared to identify any changes in protein spot density 
related to the treatment. The proteins in the spots are partially sequenced using, for example, standard 

10 methods employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the 
protein in a spot may be determined by comparing its partial sequence, preferably of at least 5 
contiguous amino acid residues, to the polypeptide sequences of the present invention. In some cases, 
further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for PPIM to quantify the 

15 levels of PPIM expression. In one embodiment, the antibodies are used as elements on a microarray, and 
protein expression levels are quantified by exposing the microarray to the sample and detecting the levels 
of protein bound to each array element (Lueking, A. et al. ( 1 999) Anal. Biochem. 270: 103-111; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 

20 reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 

25 analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 

30 sample are separated so that the amount of each protein can be quantified. The amount of each protein is 
compared to the amount of the corresponding protein in an untreated biological sample. A difference in 
the amount of protein between the two samples is indicative of a toxic response to the test compound in 
the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 

35 In another embodiment, the toxicity of a test compound is assessed by treating a biological 

51 



BNSOOC!^ <WO 01109O3A2 



WO 01/10903 



PCT/L'SOO/21878 



sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
5 samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 

10 2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarrays: A Practical Approach , M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding PPIM may be used to 
generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 

15 coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members of 
a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 

20 yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 

constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 

25 particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 
Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 
data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 

30 World Wide Web site. Correlation between the location of the gene encoding PPIM on a physical map 
and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA 
associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. Often 

35 the placement of a gene on the chromosome of another mammalian species, such as mouse, may reveal 
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associated markers even if the exact chromosomal locus is not known. This information is valuable to 
investigators searching for disease genes using positional cloning or other gene discovery techniques. 
Once the gene or genes responsible for a disease or syndrome have been crudely localized by genetic 
linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences mapping to 
5 that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R. A. ct 
al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may also be used to 
detect differences in the chromosomal location due to translocation, inversion, etc., among normal, 
carrier, or affected individuals. 

In another embodiment of the invention, PPIM, its catalytic or immunogenic fragments, or 

10 oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between PPIM and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

15 having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with PPIM, or fragments thereof, and 
washed. Bound PPIM is then detected by methods well known in the art. Purified PPIM can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 

20 non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding PPIM specifically compete with a test compound for binding PPIM. In 
this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with PPIM. 

25 In additional embodiments, the nucleotide sequences which encode PPIM may be used in any 

molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such properties 
as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

30 description, utilize the present invention to its fullest extent. The following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, in 
particular U.S. Ser. No. 60/147,986 and U.S. Ser. No. 60/160,807, are hereby expressly incorporated by 

35 reference. 
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EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in 
5 phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA purity. 
10 In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated using 
oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 
1 5 In some cases, Stratagene was provided with RNA and constructed the corresponding cDN A 

libraries. Otherwise, cDN A was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
20 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography 
(Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into 
compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid 
25 (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid (Invitrogen, Carlsbad CA), or 
pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant plasmids were transformed into 
competent E. coli cells including XLl -Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, 
DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 
30 Plasmids obtained as described in Example I were recovered from host cells by inviyo excision 

using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least one 
of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R E A L. PREP 96 plasmid 
35 purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 ml of distilled 
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water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasrnid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
5 well plates, and the concentration of amplified plasrnid DNA was quantified fluoromctrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

10 Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI PRISM 

15 BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic separation of 
cDNA sequencing reactions and detection of labeled polynucleotides were carried out using the 
MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 377 
sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 
software; or other sequence analysis systems known in the art. Reading frames within the cDNA 

20 sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). Some of 
the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the art. 
Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 

25 references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 

30 two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 

Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

35 The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
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and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 

5 using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 
and these full length sequences were subsequently analyzed by querying against databases such as the 

10 GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., Eddy, 
S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 

15 amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:28-54. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

20 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, sur)ra, 
ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LEFESEQ (Incyte Genomics). This analysis is much 
25 faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search 
can be modified to determine whether any particular match is categorized as exact or similar. The basis 
of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

30 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated as 
follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided by 
(5 times the length of the shorter of the two sequences). The BLAST score is calculated by assigning a 
35 score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for every 
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mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more than one 
HSP, then the pair with the highest BLAST score is used to calculate the product score. The product 
score represents a balance between fractional overlap and quality in a BLAST alignment. For example, 
a product score of 100 is produced only for 100% identity over the entire length of the shorter of tne two 
5 sequences being compared. A product score of 70 is produced either by 100% identity and 70% overlap 
at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is produced either 
by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding PPIM occurred. Analysis involved the categorization of cDN A libraries by 

10 organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, cell 
proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 

15 Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 3. 
V. Chromosomal Mapping of PPIM Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:28-54 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched SEQ 

20 ID NO:28-54 were assembled into clusters of contiguous and overlapping sequences using assembly 
algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available from public 
resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences had been 
previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment of all 

25 sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:30, SEQ ID NO:37, and SEQ ID NO:47 are 
described in The Invention as ranges, or intervals, of human chromosomes. The map position of an 
interval, in centiMorgans, is measured relative to the terminus of the chromosome's p-arm. (The 
centiMorgan (cM) is a unit of measurement based on recombination frequencies between chromosomal 

30 markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this 
can vary widely due to hot and cold spots of recombination.) The cM distances are based on genetic 
markers mapped by G6n6thon which provide boundaries for radiation hybrid markers whose sequences 
were included in each of the clusters. A disease associated with the public and Incyte sequences located 
within the indicated interval is also reported in the Invention. 
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VI. Extension of PPIM Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:28-54 were produced by exteasion of an 
appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
5 primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 °C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

1 0 Selected human cDN A libraries were used to extend the sequence. If more than one extension 

was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR was 
performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction mix 
contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4 , and 0- 

1 5 mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life 
Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 

20 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 ul PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 ul of undiluted PCR product into each well of an opaque fiuorimeter plate (Corning Costar, 

25 Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 ^1 to 10 fA aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, digested 

30 with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or 
sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For shotgun sequencing, 
the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, fragments were 
excised, and agar digested with Agar ACE (Promega). Extended clones were rcligated using T4 ligase 
(New England Biolabs, Beverly MA) into pUC 18 vector (Amersham Pharmacia Biotech), treated with 

35 Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and transfected into competent E, 
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coli cells. Transformed cells were selected on antibiotic-containing media, and individual colonies were 
picked and cultured overnight at 37 °C in 384- well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amcrsham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1: 94°C, 
5 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 

repeated 29 umes; Step 6: 72 °C, 5 min; Step 7; storage at 4°C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulf oxide (1:2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 

10 DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle sequencing 
ready reaction kit (PE Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:28-54 are used to obtain 5' 
regulatory sequences using the procedure above, along with oligonucleotides designed for such extension, 
and an appropriate genomic library. 

15 VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:28-54 are employed to screen cDNAs, genomic 
DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 

20 Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of [y- 32 P] adenosine 

triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston MA). 
The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 

25 genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours 
at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under 

30 conditioas of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
35 photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweilcr, supra ), mechanical 
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microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1999), surjra). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a substrate 

5 using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced using 
available methods and machines well known to those of ordinary skill in the art and may contain any 
appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; Shalon, D. el 
al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 
Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

10 comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. After 
hybridization, nonhybridized nucleotides from the biological sample are removed, and a fluorescence 

15 scanner is used to detect hybridization at each array element. Alternatively, laser desorbtion and mass 
spectrometry may be used for detection of hybridization. The degree of complementarity and the relative 
abundance of each polynucleotide which hybridizes to an element on the microarray may be assessed. In 
one embodiment, microarray preparation and usage is described in detail below. 
Tissue or Cell Sample Preparation 

20 Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/u.1 oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/pl RNase inhibitor, 500 uM dATP, 500 uM dGTP, 500 uM dTTP, 40 uM 
dCTP, 40 uM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 

25 transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 

30 using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and rcsuspended 
in 14 ul 5X SSC/0.2% SDS. 

35 Microarray Preparation 
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Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 ug. 
5 Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 
Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and coated 
10 with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 (il of the array element DNA, at an average 
concentration of 100 ng/ul, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 
15 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 0.2% 
SDS and distilled water as before. 
20 Hybridization 

Hybridization reactions contain 9 uJ of sample mixture consisting of 0.2 pg each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
25 larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 uJ of 5X SSC in a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60°C. The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0.1X SSC), and dried. 
Detection 

30 Reporter-labeled hybridization complexes are detected with a microscope equipped with an 

lnnova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 

35 scanned past the objective. The 1 .8 cm x 1.8 cm aiTay used in the present example is scanned with a 
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resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
5 filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate Alters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

10 cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1 : 1 00,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore, 
are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 

15 the calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and 
adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 

20 linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk, (due to overlapping emission 
spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 

25 is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
IX. Complementary Polynucleotides 

Sequences complementary to the PPIM-encoding sequences, or any parts thereof, are used to 

30 detect, decrease, or inhibit expression of naturally occurring PPIM. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of PPIM. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 

35 promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
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designed to prevent ribosomal binding to the PPIM-encoding transcript. 

X. Expression of PPIM 

Expression and purification of PPIM is achieved using bacterial or virus-based expression 
systems. For expression of PPIM in bacteria, cDNA is subcloned into an appropriate vector containing 
5 an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA transcription. 
Examples of such promoters include, but are not limited to, the irp-lac (tac) hybrid promoter and the T5 
or T7 bacteriophage promoter in conjunction with the lac operator regulatory element. Recombinant 
vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). Antibiotic resistant bacteria 
express PPIM upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). Expression of 

10 PPIM in eukaryotic cells is achieved by infecting insect or mammalian cell lines with recombinant 

Autographica californica nuclear polyhedrosis virus (AcMNPV), commonly known as baculovirus. The 
nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding PPIM by either 
homologous recombination or bacterial-mediated transposition involving transfer plasmid intermediates. 
Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of cDNA 

15 transcription. Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells in most 
cases, or human hepatocytes, in some cases. Infection of the latter requires additional genetic 
modifications to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Nad. Acad. Sci. USA 91:3224- 
3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, PPIM is synthesized as a fusion protein with, e.g., glutathione S- 

20 transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from PPIM at 

25 specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using 
commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). 
Methods for protein expression and purification are discussed in Ausubel (1995, supra , ch. 10 and 16). 
Purified PPIM obtained by these methods can be used directly in the assays shown in Examples XI and 

30 XV. 

XI. Demonstration of PPIM Activity 

Protease activity of PPIM is measured by the hydrolysis of appropriate synthetic peptide 
substrates conjugated with various chromogenic molecules. The degree of hydrolysis is quantified by 
spectrophotometric (or fluorometric) absorption of the released chromophorc (Beynon, R.J. and J.S. 
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Bond (1994) Proteolytic Enzymes: A PracUcal Approach , Oxford University Press, New York NY, 
pp.25-55). Peptide substrates are designed according to the category of protease activity as 
endopeptidase (serine, cysteine, aspartic proteases), animopeptidase (leucine aminopeptidase), or 
carboxypeptidase (Carboxypeptidase A and B, procollagen C-proteinase). Chromogens commonly used 
5 are 2-naphthylamine, 4-nitroaniline, and furylacrylic acid. Assays are performed at ambient temperature 
using an aliquot of PPIM and the appropriate substrate in a suitable buffer. Reactions are carried out in 
an optical cuvette and followed by the measurement of increase/decrease in absorbance of the chromogen 
released during hydrolysis of the peptide substrate. The change in absorbance is proportional to PPIM 
activity in the assay. 

10 XII. Functional Assays 

PPIM function is assessed by expressing the sequences encoding PPIM at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDN A expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 

1 5 contain the cytomegalovirus promoter. 5- 1 0 fig of recombinant vector are transiently transfected into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either liposome formulations 
or electroporation. 1-2 ^g of an additional plasmid containing sequences encoding a marker protein are 
co-transfected. Expression of a marker protein provides a means to distinguish transfected cells from 
nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. 

20 Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; Clontech), CD64, or a CD64- 
GFP fusion protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to 
identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the cells 
and other cellular properties. FCM detects and quantifies the uptake of fluorescent molecules that 
diagnose events preceding or coincident with cell death. These events include changes in nuclear DNA 

25 content as measured by staining of DNA with propidium iodide; changes in cell size and granularity as 
measured by forward light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as 
measured by decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and 
intracellular proteins as measured by reactivity with specific antibodies; and alterations in plasma 
membrane composition as measured by the binding of fluorescein-conjugated Annexin V protein to the 

30 cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. (1994) Flow CytometrY, 
Oxford, New York NY. 

The influence of PPIM on gene expression can be assessed using highly purified populatioas of 
cells transfected with sequences encoding PPIM and either CD64 or CD64-GFP. CD64 and CD64-GFP 
are expressed on the surface of traasfected cells and bind to conserved regions of human immunoglobulin 
35 G (IgG). Traasfected cells are efficiently separated from nontransfected cells using magnetic beads 
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coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). mRNA can be 
purified from the cells using methods well known by those of skill in the art. Expression of mRNA 
encoding PPIM and other genes of interest can be analyzed by northern analysis or microarray 
techniques. 

5 XIII. Production of PPIM Specific Antibodies 

PPIM substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the PPIM amino acid sequence is analyzed using LASERGENE software 
10 (DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
15 peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995. supra .) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-PPIM 
activity by, for example, binding the peptide or PPIM to a substrate, blocking with 1% BSA, reacting 
20 with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring PPIM Using Specific Antibodies 
Naturally occurring or recombinant PPIM is substantially purified by immunoaflinity 

chromatography using antibodies specific for PPIM. An immunoaffinity column is constructed by 
covalently coupling anti-PPIM antibody to an activated chromatographic resin, such as CNBr-activated 

25 SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed 
according to the manufacturer's instructions. 

Media containing PPIM are passed over the immunoaffinity column, and the column is washed 
under conditions that allow the preferential absorbance of PPIM (e.g., high ionic strength buffers in the 
presence of detergent). The column is eluted under conditions that disrupt antibody/PPIM binding (e.g., 

30 a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and 
PPIM is collected. 

XV. Identification of Molecules Which Interact with PPIM 

PPIM, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent. 
(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
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previously arrayed in the wells of a multi-well plate are incubated with the labeled PPIM, washed, and 
any wells with labeled PPIM complex are assayed. Data obtained using different concentrations of 
PPIM are used to calculate values for the number, affinity, and association of PPIM with the candidate 
molecules. 

5 Alternatively, molecules interacting with PPIM are analyzed using the yeast two-hybrid system 

as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially available 
kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

PPIM may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
10 between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 



Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
15 Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious to 
those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, 
SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, and SEQ ID 
10 NO:27, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, 

15 SEQ ID NO:17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, 
SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, and SEQ ID NO:27, 

c) a biologically active fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID 

20 NO: 14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, and SEQ ID NO:27, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 

25 NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID 
NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO: 17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:20, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, and SEQ ID NO:27. 

30 2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l, 

SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO:14, SEQ ID 
NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, and SEQ 

35 IDNO:27. 
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3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID 
NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:5 1 , SEQ ID NO:52, SEQ ID 
10 NO:53, and SEQ ID NO:54. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

15 7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim 1 , the method comprising: 

20 a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 

cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide comprises 
a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 1 , and 
b) recovering the polypeptide so expressed. 

25 1 0. An isolated antibody which specifically binds to a polypeptide of claim 1 . 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:28, SEQ ID 
30 NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID 
NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 
NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51. SEQ ID NO:52, SEQ ID NO:53, and SEQ 
ID NO:54, 

35 b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
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polynucleotide sequence selected from the group consisting of SEQ ID NO:28, SEQ ID NO:29, SEQ 
ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO.35, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 
NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID 
5 NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, and SEQ ID NO:54, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

10 1 2. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 

polynucleotide of claim 1 1 . 

1 3. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

15 a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 

comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if present, 

20 the amount thereof. 

14. A method of claim 1 3, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
25 having a sequence of a polynucleotide of claim 11, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

30 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutically acceptable excipient. 

17. A composition of claim 16, wherein the polypeptide comprises an amino acid sequence 
35 selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
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SEQ ID NO:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID NO:8, SEQ ID N0:9, SEQ ID NO: 10, SEQ ID 
NO:l 1, SEQ ID NO:12, SEQ ID N0:14, SEQ ID N0:15, SEQ ID N0:16, SEQ ID NO:17, SEQ ID 
N0:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, and SEQ ID NO:27. 

5 

1 8. A method for treating a disease or condition associated with decreased expression of 
functional PPIM, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

10 19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 

claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

15 20. A composition comprising an agonist compound identified by a method of claim 19 and a 

pharmaceutical^ acceptable excipient. 

21 . A method for treating a disease or condition associated with decreased expression of 
functional PPIM, comprising administering to a patient in need of such treatment a composition of 

20 claim 20. 

22. A method for screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
25 b) detecting antagonist activity in the sample. 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceutically acceptable excipient. 

30 24. A method for treating a disease or condition associated with overexpression of functional 

PPIM, comprising administering to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 
35 a) combining the polypeptide of claim 1 with at least one test compound under suitable 
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conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1 . 

5 26. A method of screening for a compound that modulates the activity of the polypeptide of 

claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

10 and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

15 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 
20 b) detecting altered expression of the target polynucleotide. 

28. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
25 least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 

hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

30 d) comparing the amount of hybridization complex in the treated biological sample with the 

amount of hybridization complex in an untreated biological sample, wherein a difference in the amount 
of hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 
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Leu Ser Arg Ala Tyr Asp Gly Thr Thr 
215 

Gly Leu Asn Asn He Lys Ala Asn Asp 
23 0 

Gin Ala Leu Ser Asn Val Pro Pro Leu 
245 

Glu Asp Asn Tyr Lys Asn He Lys Arg 
260 

Phe Leu Leu Val Gin Arg Phe Gly Glu 
275 

Asn Pro Arg Asn Phe Lys Ala His Val 
290 

Gin Ala Val Val Leu Cys Ser Lys Lys 
305 

Gin Gly Asp Gly Val Asp Phe Leu Ser 
320 

His Ser Ala Leu Gly Gly Thr Lys Lys 
335 

Thr Asp Val Phe Gin Gly Ser Met Arg 
3 50 

Pro His Pro Asp Leu Pro Ala Glu Glu 
365 

Asn Asp Glu Tyr Gin Glu Thr Met Val 
380 

Leu Thr Leu Asp Leu Pro Thr Ala Pro 

395 

Glu Gin Leu lie He Pro Gin Val Pro 
410 

Lys Phe Asn Gly He Thr Glu Lys Glu 
425 

Asn Phe Leu Lys Arg Phe Gin Leu Thr 
440 

He Phe Cys He Lys Arg Phe Thr Lys 
455 

Lys Asn Pro Thr lie Val Asn Phe Pro 
470 

Arg Glu Tyr Leu Ser Glu Glu Val Gin 
485 

Thr Tyr Asp Leu He Ala Asn lie Val 
500 

Glu Gly Ser Tyr Arg He His Val Leu 
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Trp Tyr Glu Leu Gin Asp Leu Gin Val 
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Met He Thr Leu Ser Glu Ala Tyr He 
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Asp Asn Asp Glu Thr Asn Gin Gin Gly 
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<400> 10 



Met 


Ala 


Ala 


Ala 


Val 


Arg 


Gin 


Asp 


Leu 


1 








5 










Ser 


Gly 


Ser 


His 


Lys 


Asp 


Leu 


Ala 


Gly 










20 










Glu 


Lys 


Ala 


He 


Gin 


Leu 


Ser 


Gly 


Ala 








35 










Lys 


Ala 


Phe 


Val 


Glu 


Ala 


Met 


Val 


Asn 








50 










He 


Ser 


Arg 


Gin 


Leu 


Leu 


Thr 


Asp 


Phe 










65 










Leu 


Pro 


Asp 


Ser 


Thr 


Ala 


Lys 


Glu 


He 










80 










Lys 


He 


Gin 


Pro 


Arg 


Val 


He 


Ser 


Phe 








95 










He 


Arg 


Gin 


His 


Leu 


Ala 


Ser 


He 


Tyr 










110 










Arg 


Asn 


Ala 


Ala 


Gin 


Val 


Leu 


Val 


Gly 








125 










Gin 


Lys 


Gin 


Tyr 


Asn 


Val 


Asp 


Tyr 


Lys 










140 










He 


Ala 


Arg 


Leu 


Tyr 


Leu 


Glu 


Asp 


Asp 










155 










Ala 


Tyr 


He 


Asn 


Arg 


Ala 


Ser 


Leu 


Leu 










170 










Glu 


Gin 


Leu 


Gin 


He 


His 


Tyr 


Lys 


Val 










185 










Asp 


Tyr 


Arg 


Arg 


Lys 


Phe 


lie 


Glu 


Ala 










200 










Leu 


Ser 


Tyr 


Lys 


Thr 


He 


Val 


His 


Glu 










215 










Leu 


Lys 


His 


Ala 


Leu 


His 


Cys 


Thr 


He 










230 
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85 










90 


Glu 


Ala 


Met 


Gin 


Trp 


Val 


100 










105 


Ser 


He 


Leu 


Gin 


Ala 


Leu 


115 










120 


Glu 


Glv 


Leu 


Tyr 


Leu 


Leu 


130 








135 


Ser 


Leu 


Val 


Leu 


Asn 


Glu 


145 










150 


Val 


Lys 


Val 


His 


Thr 


He 


160 








165 


Val 


Glu 


Phe 


Leu 


Arg 


Lys 


175 










180 


His 


Cys 


Pro 


Val 


Gly 


Glu 


190 










195 


Leu 


Thr 


Lys 


Glv 


Phe 


He 


205 










210 


Phe 


Glu 


Gly 


Asp 


Asp 


Leu 


220 










225 


Ala 


Arg 


Ser 


Phe 


Leu 


Trp 


235 










240 


Gin 


Lys 


Lys 


Asn 


Asp 


Ala 


250 










255 


Ala 


Gin 


Leu 


Met 


Asn 


Ser 


10 










15 


Lys 


Tvr 
j 


Arg 


Gin 


He 


Leu 


25 










30 


Glu 


Gin 


Leu 


Glu 


Ala 


Leu 


40 










45 


Glu 


Asn 


Val 


Ser 


Leu 


Val 


55 










60 


Cvs 


Thr 


His 


Leu 


Pro 


Asn 


70 










75 


Tvr 


His 


Phe 


Thr 


Leu 


Glu 


85 










90 


Glu 


Glu 


Gin 


Val 


Ala 


Ser 


100 










105 


Glu 


Lys 


Glu 


Glu 


Asp 


Trp 


115 










120 


He 


Pro 


Leu 


Glu 


Thr 


Gly 


13 0 










135 


Leu 


Glu 


Thr 


Tvr 


Leu 


Lys 


145 










150 


Asp 


Pro 


Val 


Gin 


Ala 


Glu 


160 










165 


Gin 


Asn 


Glu 


Ser 


Thr 


Asn 


175 










180 


Cys 


Tyr 


Ala 


Arg 


Val 


Leu 


190 










195 


Ala 


Gin 


Arg 


Tyr 


Asn 


Glu 


205 










210 


Ser 


Glu 


Arg 


Leu 


Glu 


Ala 


220 










225 


Leu 


Ala 


Ser 


Ala 


Gly 


Gin 


235 










240 
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Gin 


Arg 


Ser 


Arg 


Met 
245 


Leu 


Ala 


Thr 


Gin 


Gin 


Leu 


Ala 


Ala 

260 


Tyr 


Gly 


He 


Arg 


He 


He 


Arg 


Gly 
275 


Asn 


Gin 


Leu 


Met 


Pro 


His 


Gin 


Lys 
290 


Ala 


Thr 


Thr 


Asp 


Arg 


Ala 


Val 


He 
305 


Glu 


His 


Asn 


Tyr 


Asn 


Asn 


He 


Thr 


Phe 


Glu 


Glu 








320 








Pro 


Ala 


Ala 


Lys 


Ala 
335 


Glu 


Lys 


He 


Gly 


Arg 


Met 


Asn 


Gly 


Phe 


He 


Asp 








350 








Phe 


Glu 


Thr 


Arg 


Glu 
365 


Ala 


Leu 


Pro 


Ser 


Leu 


Cys 


Phe 


Gin 


Val 


Asn 


Asn 








380 








Thr 


Ala 


Pro 


Glu 


Trp 
395 


Thr 


Ala 


Gin 



Gin 

<210> 11 

<211> 172 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3489039CD1 

<400> 11 



Met 


Leu 


Leu 


Pro 


Asn 

5 


He 


Leu 


Leu 


1 

Lys 


Thr 


Thr 


Leu 


Gly 


Lys 


Glu 


Leu 








20 








Tyr 


He 


Asn 


Val 


Gly 


Asp 


Leu 


Ala 








35 








Gly 


Tyr 


Asp 


Glu 


Glu 


Tyr 


Asp 


Cys 










50 








Val 


Val 


Asp 


Glu 


Leu 


Asp 


Asn 


Gin 










65 








Val 


Asp 


Tyr 


His 


Gly 


Cys 


Asp 


Phe 










80 








He 


Val 


Phe 


Val 


Leu 


Arg 


Thr 


Asp 










95 








Leu 


Glu 


Thr 


Arg 


Gly 


Tyr 


Asn 


Glu 










110 








Gin 


Cys 


Glu 


He 


Phe 


Gin 


Val 


Leu 








125 








Tyr 


Lys 


Glu 


Glu 


He 


Val 


His 


Gin 








140 








Glu 


Leu 


Glu 


Asn 


Asn 


Val 


Asp 


Gin 










155 








Trp 


He 


Lys 


Asp 


His 


Asn 


Ser 












170 









<210> 12 

<211> 517 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5432879CD1 



Leu 


Phe 
250 


Lys 


Asp 


Glu 


Arg 


Cys 
255 


Leu 


Glu 

265 


Lys 


Met 


Tyr 


Leu 


Asp 

270 


Gin 


GlU 
280 


Phe 


Ala 


Ala 


Met 


Leu 
285 


Ala 


Asp 
295 


Gly 


Ser 


Ser 


He 


Leu 

300 


Leu 


Leu 
310 


Ser 


Ala 


Ser 


Lys 


Leu 
315 


Leu 


Gly 
325 


Ala 


Leu 


Leu 


Glu 


He 
330 


Ala 


Ser 
340 


Gin 


Met 


lie 


Thr 


Glu 
345 


Gin 


He 
355 


Asp 


Gly 


lie 


Val 


His 
360 


Thr 


Trp 
370 


Asp 


Lys 


Gin 


He 


Gin 
375 


Leu 


Leu 

385 


Glu 


Lys 


He 


Ser 


Gin 
390 


Ala 


Met 
400 


Glu 


Ala 


Gin 


Met 


Ala 
405 



Thr 


Gly 
10 


Thr 


Pro 


Gly 


Val 


Gly 
15 


Ala 


Ser 
25 


Lys 


Ser 


Gly 


Leu 


Lys 
30 


Arg 


Glu 
40 


Glu 


Gin 


Leu 


Tyr 


Asp 
45 


Pro 


He 
55 


Leu 


Asp 


Glu 


Asp 


Arg 
60 


Met 


Arg 
70 


Glu 


Gly 


Gly 


Val 


He 
75 


Phe 


Pro 

85 


Glu 


Arg 


Trp 


Phe 


His 
90 


Thr 


Asn 
100 


Val 


Leu 


Tyr 


GlU 


Arg 
105 


Lys 


Lys 
115 


Leu 


Thr 


Asp 


Asn 


lie 
120 


Tyr 


Glu 


Glu 


Ala 


Thr 


Ala 


Ser 


130 










135 


Leu 


Pro 
145 


Ser 


Asn 


Lys 


Pro 


Glu 

150 


He 


Leu 
160 


Lys 


Trp 


He 


Glu 


Gin 
165 
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<400> 12 



Met 


Leu 


Ser 




Arg 


J-Cl 


Glu 


Ala 




Met 


Thr 


Ala 


Ala 


Asp 


Arg 


1 








c 

D 










1 0 

1 u 










15 


Ala 


lie 


Gin 


Arg 


filer 




Ar g 


Thr 


Gly 


Ala 


Ala 


Val 


Ara 


Tvr 


Lys 








20 




















3 0 


Val 


Met 


Lys 


Asn 


Trp 


b iy 




1 1 tr 


P 1 \r 

b 1 y 


uiy 


He 


Ala 


Ala 


Ala 


Leu 








J b 










a n 










45 


Ala 


Ala 


Gly 


lie 


Tyr 


Vdl 


lie 


Trp 


biy 




He 


Thr 


Glu 


Arg 


Lys 








c n 
b (J 




















60 


Lys 


Arg 


Arg 


Lys 




Leu 


v a 1 


Pro 


Gly 


Leu 


Val 


Asn 


Leu 


Glv 


Asn 








c c 
o b 




















7 5 


Thr 


Cys 


Phe 


Met 


Asn 


Ser 


Leu 


Leu 


Pin 
bin 


Civ 
uiy 


eu 


Ser 


Ala 




Pro 








o n 
o U 




















90 


Ala 


Phe 


lie 


Arg 


Trp 


Leu 


ulU 


p 1 1 > 


rule 


lill 


C ^ v- 

0 tr 1 


Gin 


iy l 


Ser 


Arg 








y b 










1 u u 










1 OR 

1 U J 


Asp 


Gin 


Lys 


Glu 


Pro 


Pro 




Ui p 

n 1 0 


Plr\ 
bill 


iyr 


Leu 


Ser 


Leu 


Thr 


Leu 






110 










1 1 tr 










1 9 n 

1 z u 


Leu 


His 


Leu 


Leu 


Lys 


Ala 


Leu 


Ser 


Cys 


r* 1 n 

bin 


pin 
ulU 


val 


111 X 


Asp 












TIC 

lz b 










1j u 










13 5 


Glu 


Val 


Leu 


Asp 


Ala 


Ser 


Cys 


Leu 


Leu 


Asp 


Val 




rg 


Met 


iyx 








14 0 










1 A 
14i _) 










1 J V 


Arg 


Trp 


Gin 


lie 


Ser 


Ser 


Phe 


Pin 
biU 


Pin 


p "I >-l 

bin 


Asp 


nla 


U i c 

nib 


pin 

blU 


Leu 






■ICC 

lbb 










1 £ n 
1 0 u 










1 O Zj 


Phe 


His 


Val 


lie 


Thr 


Ser 


Ser 


Lgu 


Plii 
blU 


Asp 


blU 


Arg 


Asp 


Arg 


Pin 
bill 










1 *7 A 
1 / U 










1 / j 










1 p. n 

1 0 \J 


Pro 


Arg 


Val 


Thr 


His 


Leu 


rue 


Asp 


Wa 1 
val 


nib 


Car 


Leu 


Pi 11 

b 1 LI 


PI n 

bill 


Gin 








185 










1 on 

i y u 










1 y -j 


Ser 


Glu 


I le 


Thr 


Pro 


Lys 


Gin 


lie 


Tnr 


Cys 


Arg 


1 111 


Arg 


Pi;; 

b iy 


Cor 
O cl 










200 










one. 










Z 1 U 


Pro 


His 


Pro 


Thr 


Ser 


Asn 


His 


Trp 


Lys 


Ser 


ply, 

bin 


ril S 


Pro 


rile 


nib 










215 










OTA 
ZZ U 










z z _> 


Gly 


Arg 


Leu 


Thr 


Ser 


Asn 


Met 


val 


Cys 


Lys 


His 


Cys 


piii 

blU 


O i c 

MIS 


Pin 

bin 






23 0 










ZJ b 










O A Pi 

Z4 u 


Ser 


Pro 


Val 


Arg 


Phe 


Asp 


Thr 


Phe 


Asp 


Ser 


Leu 


Ser 


Leu 


Ser 


Tl Q 

lie 








245 










OCA 










Z bb 


Pro 


Ala 


Ala 


Thr 


Trp 


Gly 


His 


Pro 


Leu 


Thr 


Leu 


Asp 


His 


Cys 


Leu 










2 60 










"5 £ C 










z / u 


His 


His 


Phe 


lie 


Ser 


Ser 


Glu 


Ser 


vai 


Arg 


Asp 


vai 


\J-, T 

va 1 


Cys 


Asp 










275 










T G A 

Z oU 










Z £5 _> 


Asn 


Cys 


Thr 


Lys 


lie 


Glu 


Ala. 


Lys 




Thr 


Leu 


Asn 


biy 


pin 
blU 


Lys 






290 










z y b 












Val 


Glu 


His 


Gin 


Arg 


Thr 


Thr 


rxie 


vai 


Lys 


bin 


Leu 


Lys 


Leu 


nil; 
b iy 










■j a C 

3 0b 






















Lys 


Leu 


Pro 


Gin 


Cys 


Leu 


Cys 


lie 


Ui e 

HIS 


Leu 


pin 

bin 


Arg 


Leu 


ser 


irp 








3 20 










J Z b 










t 1 n 


Ser 


Ser 


His 


Gly 


Thr 


Pro 


Leu 


Lys 


Arg 


ill s 


piii 

b 1 U 


nib 


val 


Pin 
bill 


Pino 










33 5 










1 a n 










7 / C 
O f± D 


Asn 


Glu 


Phe 


Leu 


Mec 


Wirt ¥■ 

Met 


Asp 


T 1 0 

lie 


Tyr 


Lys 


Tyr 


nib 


Leu 


Leu 


pi v 
oiy 










3 bu 










ire 










7 A fi 

J D U 


His 


Lys 


Pro 


Ser 


Gin 


His 


Asn 


Pro 


Lys 


Leu 


Asn 


Lys 


As n 


Pro 


Plv 
b iy 








t re 
ODD 










Tin 
J / u 










37 5 


Pro 


Thr 


Leu 


pin 


Lgu 


bin 


Asp 


P 1 w 


ro 


P 1 \/ 
uiy 


Ala 


Pro 


Thr 


Pro 


Val 










j SU 










.j 0 j 










3 90 


Leu 


Asn 


bin 


Pro 


biy 


Ala. 


Pro 


Lys 


1 ill 


pi ft 

bill 


T 1 Q 

iic 


riltr 


Met 


Asn 


Gly 










one; 

J 3 J 










a n n 










4 05 


Ala 


Cys 


Ser 


ir L U 




Leu 


Leu 


Pro 


Thr 


Leu 


Ser 


Ala 


Pro 


Met 


Pro 








*i 1 U 










415 










4 2 0 


Phe 


Pro 


Leu 


tr L V 


Val 


val 


Pro 


As p 


ly 1 


Ser 


Ser 


Ser 


Thr 


xy 1 


Leu 










4 9 R 










43 0 










435 


Pne 


Arg 


Leu 


Mot 


.rt 1 a 


V a. J. 


Val 


Val 


His 


His 


Gly 


Asp 


Met 


His 


Ser 








a a n 

U 










*i *± J 










*i -J U 


Gly 


His 


Phe 


Val 


Thr 


Tyr 


Arg 


Arg 


Ser 


Pro 


Pro 


Ser 


Ala 


Arg 


Asn 








455 










460 










465 


Pro 


Leu 


Ser 


Thr 


Ser 


Asn 


Gin 


Trp 


Leu 


Trp 


Val 


Ser 


Asp 


Asp 


Thr 










470 










475 










480 


Val 


Arg 


Lys 


Ala 


Ser 


Leu 


Gin 


Glu 


Val 


Leu 


Ser 


Ser 


Ser 


Ala 


Tyr 






485 










490 










495 



12/44 



BNSDOClD <W0 0110903A? I 



WO 01/10903 



PCT/USOO/21878 



Leu Leu Phe Tyr Glu Arg Val Leu Ser Arg Met Gin His Gin Ser 
500 505 510 

Gin Glu Cys Lys Ser Glu Glu 
515 

<210> 13 
<211> 346 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5853753CD1 

<400> 13 



Met 


Val 


Glu 


Lys 


Glu 


Glu 


Ala 


Gly Gly 


Gly 


He 


Ser 


Glu 


Glu 


Glu 


1 






5 










10 










15 


Ala 


Ala 


Gin 


Tyr 


Asp 


Arg 


Gin 


He 


Arg 


Leu 


Trp 


Gly 


Leu 


Glu 


Ala 








20 










25 










30 


Gin 


Lys 


Arg 


Leu 


Arg 


Ala 


Ser 


Arg 


Val 


Leu 


Leu 


Val 


Gly 


Leu 


Lys 








35 










40 










45 


Gly 


Leu 


Gly Ala 


Glu 


He 


Ala 


Lys 


Asn 


Leu 


He 


Leu 


Ala 


Gly 


Val 








50 










55 










60 


Lys 


Gly 


Leu 


Thr 


Met 


Leu 


Asp 


His 


Glu 


Gin 


Val 


Thr 


Pro 


Glu 


Asp 






65 










70 










75 


Pro 


Gly 


Ala 


Gin 


Phe 


Leu 


He 


Arg 


Thr 


Gly 


Ser 


Val 


Gly 


Arg 


Asn 








80 










85 










90 


Arg 


Ala 


Glu 


Ala 


Ser 


Leu 


Glu 


Arg 


Ala 


Gin 


Asn 


Leu 


Asn 


Pro 


Met 








95 










100 










105 


Val 


Asp 


Val 


Lys 


Val 


Asp 


Thr 


Glu 


Asp 


He 


Glu 


Lys 


Lys 


Pro 


Glu 








110 










115 










120 


Ser 


Phe 


Phe 


Thr 


Gin 
125 


Phe 


Asp 


Ala 


Val 


Cys 
130 


Leu 


Thr 


Cys 


Cys 


Ser 
135 


Arg 


Asp 


Val 


He 


Val 
140 


Lys 


Val 


Asp 


Gin 


He 
145 


Cys 


His 


Lys 


Asn 


Ser 
150 


He 


Lys 


Phe 


Phe 


Thr 


Gly 


Asp 


Val 


Phe 


Gly 


Tyr 


His 


Gly 


Tyr 


Thr 








155 










160 










165 


Phe 


Ala 


Asn 


Leu 


Gly 
170 


Glu 


His 


Glu 


Phe 


Val 
175 


Glu 


Glu 


Lys 


Thr 


Lys 
180 


Val 


Ala 


Lys 


Val 


Ser 


Gin 


Gly 


Val 


Glu 


Asp 


Gly 


Pro 


Asp 


Thr 


Lys 








185 










190 










195 


Arg 


Ala 


Lys 


Leu 


Asp 
200 


Ser 


Ser 


Glu 


Thr 


Thr 
205 


Met 


Val 


Lys 


Lys 


Lys 
210 


Val 


Val 


Phe 


Cys 


Pro 
215 


Val 


Lys 


Glu 


Ala 


Leu 
220 


Glu 


Val 


Asp 


Trp 


Ser 
225 


Ser 


Glu 


Lys 


Ala 


Lys 
230 


Ala 


Ala 


Leu 


Lys 


Arg 

235 


Thr 


Thr 


Ser 


Asp 


Tyr 
240 


Phe 


Leu 


Leu 


Gin 


Val 
245 


Leu 


Leu 


Lys 


Phe 


Arg 
250 


Thr 


Asp 


Lys 


Gly 


Arg 
255 


Asp 


Pro 


Ser 


Ser 


Asp 


Thr 


Tyr 


Glu 


Glu 


Asp 


Ser 


Glu 


Leu 


Leu 


Leu 








260 










265 










270 


Gin 


He 


Arg 


Asn 


Asp 
275 


Val 


Leu 


Asp 


Ser 


Leu 
280 


Gly 


He 


Ser 


Pro 


Asp 
285 


Leu 


Leu 


Pro 


Glu 


Asp 
290 


Phe 


Val 


Arg 


Tyr 


Cys 
295 


Phe 


Ser 


Glu 


Met 


Ala 

300 


Pro 


Val 


Cys 


Ala 


Val 
305 


Val 


Gly 


Gly 


He 


Leu 
310 


Ala 


Gin 


Glu 


He 


Val 
315 


Lys 


Ala 


Leu 


Ser 


Gin 


Arg 


Asp 


Pro 


Pro 


His 


Asn 


Asn 


Phe 


Phe 


Phe 








320 










325 










330 


Phe 


Asp 


Gly Met 


Lys 


Gly Asn 


Gly 


He 


Val 


Glu 


Cys 


Leu 


Gly 


Pro 










335 










340 










345 



Lys 

<210> 14 

<211> 151 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 411344CD1 

<400> 14 



Mo t- 


A 1 3 




Met 


Gin 


Lys 


Arg 


Leu 


Gin 


Lys 


Glu 


Leu 


Leu 


Ala 


Leu 


1 
















10 










15 


Gin 


Asn 


- 

Asp 


Pro 


Pro 


Pro 


G 1 v 


Met 


Thr 


Leu 


Asn 


Glu 


Lys 


Ser 


Val 








2 0 










25 










30 


bin 


Asn 


OCX. 


Tip 
lie 


Thr 


Gin 




He 


Val 


Asp 


Met 


GlU 


Glv 


Ala 


Pro 










3 5 










40 










45 




Thr 


Leu 


Tvr 
1 j L 


Glu 


Gly 


Glu 


Lys 


Phe 


Gin 


Leu 


Leu 


Phe 


Lys 


Phe 






5 0 










55 










60 


Ser 


Ser 




TVr 


Pro 


Phe 


Asp 


Ser 


Pro 


Gin 


Val 


Met 


Phe 


Thr 


Glv 






65 










7 0 










75 


Glu 


Asn 


He 


Pro 


Val 


His 


Pro 


His 


Val 


Tyr 


Ser 


Asn 


Gly 


His 


He 










80 










85 










90 


Cys 


Leu 


Ser 


He 


Leu 


Thr 


Glu 


Asp 


Trp 


Ser 


Pro 


Ala 


Leu 


Ser 


Val 








95 










100 










105 


Gin 


Ser 


Val 


Cys 


Leu 


Ser 


He 


He 


Ser 


Met 


Leu 


Ser 


Ser 


Cys 


Lys 










110 










115 










120 


Glu 


Lys 


Arg 


Arg 


Pro 


Pro 


Asp 


Asn 


Ser 


Phe 


Tyr 


Val 


Arg 


Thr 


Cys 








125 










130 










135 
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Tyr 
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Arg 


Arg 
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Pro 


160 
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Asp 
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Gin 
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Met 


Ser 


Leu 
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Glu 


Asp 
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Val 


Arg 
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Val 


Gin 
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210 


Val 


Thr 


Val 


Cvs 


Asn 


Gin 
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Tvr 
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Leu Asn Glu Asp Tyr Lys Thr Val Asn 
575 

Val Lys Gly Ser Asp Gly Phe Trp Val 
5 9 0 

Ser Trp Arg Gin Leu Ala Leu Glu Gin 
605 

Asp Ala Glu Gin Ser Asn Gly Lys Met 
620 

Lys Asp Glu Ser Lys Glu Glu Arg Lys 
635 

Phe Asn Glu Asp lie Leu Cys Pro His 
650 

Glu Asn Glu Arg Arg Leu Val Ser Lys 
665 

Gin Gin Tyr Phe Pro Lys Ala Pro Glu 
680 

Cys Cys Ser Gin Cys Lys He Leu Glu 
695 

Glu Ala Leu His Lys Met He Ala Asn 
710 

Pro Asn Leu Phe Gin Asp Lys Asn Arg 
725 

Pro Glu Asp Thr Asp Val Leu Tyr He 
740 

Glu Glu Trp Arg Lys Phe Val Arg Lys 
755 

Val Ser Ser Val Gly Asn Ser Ala Leu 
770 

Leu Met Phe Thr Phe Ala Ser Met Thr 
785 

He Ala Leu He Trp Pro Ser Glu Trp 
800 

Phe Val Val Asp His Val He Lys He 
815 

Asp Val Asn Pro Ser Glu Thr Gin Tyr 
830 

Cys Pro Glu cys Arg Glu Gly Leu Leu 
845 

Leu Arg Glu Tyr Thr Gin Ala Thr He 
860 

Asp Asn Lys Lys Val Met Lys Asp Ser 
875 

Ser Ser Ser Glu Thr Glu Glu Asp Lys 
890 

Gly Glu Lys Asp Pro Asp Phe Asn Gin 
905 

Val Ala Pro Phe Asp Gin Asn Leu Ser 

920 

Ser Asp Asp Cys Ala Thr Leu Gly Thr 
935 

Ser Val He Leu Leu Lys Ala Asp Glu 
950 

Ala Met Asp Asp Val Met Gin Val Cys 
965 

Lys Gly Thr Gly Leu Leu Gly His 
980 
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Arg 


Leu 
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Ala 
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Ala 
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He 
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Glu 
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Thr 


Leu 
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He 
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Asn 


Lys 
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Leu 
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He 
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He 
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Ser 


Thr 


Val 
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Asn 


Thr 
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Val 
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j 3 \j 
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Arg 


Val 
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Leu 
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Mot' 


Lys 
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J 1 J 


He 


Trp 
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Asn 


Lys 
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Tin 

j j u 
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Leu 


lie 
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3 45 
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Pro 


Pne 


val 


val 


Asp 


3 55 










^ fi n 
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Asn 


Gly 


Leu 


Lys 


Lys 
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Tyr 
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ser 


Ser 
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Gin 


Gin 


Pro 
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Met 


Ser 


Lys 


Ala 


Phe 


Gly 


Leu 


1 








5 






Leu 


Ala 


Glu 


Ser 


Ser 


Gin 


Ser 










20 






Glu 


Glu 


Asp 


Ser 


Asn 


Met 


Lys 










35 






Arg 


Ala 


Trp 


Asp 


Tyr 


Pro 


His 
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Gin 


Thr 


Cys 


Cys 


Leu 


Asn 
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Val 


Asp 


Phe 


Thr 


Arg 
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Arg 
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Lys 


Met 
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Cys 
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Lys 
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He 
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Met 
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Arg 
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Ser 
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Phe 
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Val 
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Cys 
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Leu 


Arg 
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He 


Cys 


Gin 


Ser 


He 
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15 
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Glu 


Glu 


Lys 


Lys 
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Arg 
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Arg 
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Pro 
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Leu 
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Met 
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100 










105 


Ser 


Arg 


Gin 


Lys 


Ala 


Val 


Arg 


Pro 
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215 










220 
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T \/c 


rye 


Phe 
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v-y s 
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Glv 


Lys 


Lys 


Thr 


Arg 


Glv 


LVS 






23 0 










235 










240 


bin 


V a. ± 


Leu 


Lys 




Thr 


His 


Leu 


Pro 


Gin 


Thr 


Leu 


Thr 


He 


His 








245 










250 










255 


LeU 


Mat" 


Aircj 


Phe 


Ser 
260 


He 


Arg 


Asn 


Ser 


Gin 
265 


Thr 


Arg 


Lys 


He 


Cvs 
270 


ri 1 s 


Cor 


Leu 


1 y 1. 


Phe 


Pro 


Gin 


Ser 


Leu 


Asp 


Phe 


ser 


Gin 


He 


Leu 








275 










280 










285 


Pi*o 


Mot" 


Lys 
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Ala 
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GlU 


Gin 


Ser 


Glv 
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Trn 
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340 
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Thr 


Tyr 
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Trp 


Gin 


Glu 








350 










355 










360 


Thr 


Ala 


Tyr 


Leu 


Leu 


Val 


Tyr 


Met 
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<210> 21 

<211> 94 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3092341CD1 

<400> 21 

Met Leu Arg Gly Val Leu Gly Lys Thr Phe Arg Leu Val Gly Tyr 
15 10 15 

Thr He Gin Tyr Gly Cys He Ala His Cys Ala Phe Glu Tyr Val 

20 25 30 

Gly Gly Val Val Met Val Pro Met Gly His Val Trp Leu Glu Gly 

35 40 45 

Asp Asn Leu Gin Asn Ser Thr Asp Ser Arg Cys Tyr Gly Pro He 

50 55 60 

Pro Tyr Gly Leu He Arg Gly Arg He Phe Phe Lys He Trp Leu 

65 " 70 75 

Leu Ser Asp Phe Gly Phe Leu Arg Ala Ser Pro Asn Gly His Arg 

80 85 90 

Phe Ser Asp Asp 

<210> 22 
<211> 248 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3658034CD1 

<400> 22 

Met Asn Thr Glu Arg Thr Asn He Gin Val Thr Val Thr Gly Pro 
15 10 15 

Ser Ser Pro Ser Pro Val Lys Phe Leu He Asp Thr His Asn Arg 

20 25 30 

Leu Leu Leu Gin Thr Ala Glu Leu Ala Val Val Gin Pro Thr Ala 

35 40 45 

Val Asn He Ser Ala Asn Gly Phe Gly Phe Ala He Cys Gin Leu 

50 55 60 

Asn Val Val Tyr Asn Val Lys Ala Ser Gly Ser Ser Arg Arg Arg 

65 70 75 

Arg Ser He Gin Asn Gin Glu Ala Phe Asp Leu Asp Val Ala Val 

80 85 90 
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Val 
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Val 
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Ala 
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Arg 
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lip 


Asp 


275 


Val 


Leu 


Ala 


Val 


Gin 
280 


Ser 


Leu 


Tvr 


Glv 


Lys 
285 


Pro 


Leu 


pi v 
u ly 


Gly 


C q v- 


Val 


Ala 


Val 


Gin 


Leu 


Pr o 


Glv 


Lys 


Leu 


Phe 








290 










295 










300 


1 i i L 


Asp 


Phe 




i in 

305 


lip 


A SD 


Ser 


Tvr 


Ser 
310 


Pro 


Gin 


Glv 

° x y 


Arg 


Arg 
315 


Drn 


r>l n 

\j 1 U. 


Thr 


Gin 


G ly 
320 


Pro 
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Tvr 
340 
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Lys 
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Ser 
345 


Hie 
tils 


ph o 
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lip 
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Val 
350 


Ala 


Ala 


Asp 


Gl V 


Asn 
355 


Val 


Ser 


Glu 


Pro 


Arg 
360 


Pro 


LeU 


Gin 


G lu 


365 


Trn 
lip 


Val 


Gly 


Leu 


Pro 
370 


Pro 


Asn 


He 


Glu 


Ala 
375 






v d 1 


Ser 


380 


Asn 




Gly 


Asp 


Phe 
385 


Tin- 

^■y 


Phe 


Phe 


Lys 


Gly 
390 




Arg 1 


Cys 


irp 


Ar cj 
395 


rl lfcr 


Arg 


Gly 


Pro 


Ly s 
400 


Pr o 


Val 




Gly 


Leu 
405 


Pro 


Pin 

bin 


Leu 


Cys 


Arcj" 
410 


nla 


ply 


Gly 




Pr n 

415 


Arg 


His 


Pro 


Asp 


Ala 
420 


Til -a 


LiGU 


Dries 
rile 




C 1 KJ 

425 


Pro 




Arg 


Arg 


Leu 
43 0 


He 


Leu 


Phe 


Lys 


Gly 
435 


nla 


Ar cj 


Tyr 


Tyr 


Val 
440 


Leu. 


Ala 


Arg 


Gly 


Gly 
445 


Leu 


G In 


Val 


Glu 


Pro 
450 


Tyr 


Tyr 


Pro 


Arg 


Ser 
455 




G In 


Asp 


Trn 
lip 


Gly 
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Gly 
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Pro 


Glu 


Glu 
465 
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Gly Ala 


Leu. 


Dm 
.r 1 LJ 


Arg 
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He 


He 


Phe 


Phe 
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Asp 


Arg 
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<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4993873CD1 

<400> 24 
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Val 
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Gin 
20 
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Leu 


Leu 


Ala 


His 
25 
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Lys 


Ser 


Leu 
30 


Gin 
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Pro 


Gin 


Pro 


Pro 


Arg 


His 


Gin 


Leu 


Ser 


Glu 


Pro 


Ala 


Pro 








35 










40 










45 


Ala 


Tyr 


His 


Arg 


He 
50 


Thr 


Pro 


Thr 


He 


Thr 
55 


Asn 


Phe 


Ala 


Leu 


Arg 
60 


Leu 


Tyr 


Lys 


Glu 


Leu 
65 


Ala 


Ala 


Asp 


Ala 


Pro 
70 


Gly 


Asn 


He 


Phe 


Phe 
75 


Ser 


Pro 


Val 


Ser 


He 
80 


Ser 


Thr 


Thr 


Leu 


Ala 
85 


Leu 


Leu 


Ser 


Leu 


Gly 
90 


Ala 


Gin 


Ala 


Asn 


Thr 
95 


Ser 


Ala 


Leu 


He 


Leu 
100 


Glu 


Gly 


Leu 


Gly 


Phe 
105 


Asn 


Leu 


Thr 


Glu 


Thr 
110 


Pro 


Glu 


Ala 


Asp 


He 
115 


His 


Gin 


Gly 


Phe 


Arg 
120 


Ser 


Leu 


Leu 


His 


Thr 
125 


Leu 


Ala 


Leu 


Pro 


Ser 
130 


Pro 


Lys 


Leu 


Glu 


Leu 
135 


Lys 


Val 


Gly 


Asn 


Ser 
140 


Leu 


Phe 


Leu 


Asp 


Lys 
145 


Arg 


Leu 


Lys 


Pro 


Arg 
150 


Gin 


His 


Tyr 


Leu 


Asp 
155 


Ser 


He 


Lys 


Glu 


Leu 
160 


Tyr 


Gly 


Ala 


Phe 


Ala 
165 
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Phe 


i nr 


Asp 


S sr 


Val 










1 1 U 










Asn 


Asp 
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G lu 
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Gin 
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i nr 












200 










lie 


Phe 


Phe 


Lys 


Ala 


Lys 


Trp 


Lys 
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Thr 


Gin 


Lys 


Gin 


Ala 
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Phe 


Phe 


Val 










230 










Gin 


Val 


Pro 


Met 


Met 


His 


Gin 


Lys 
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Gin 


Asp 


Leu 


Ala 


Cys 


Thr 


Va 1 


Leu 










2 6 0 










Asn 


Ala 


Leu 


Ala 


Leu 

~) H C 


Leu 


Val 


Leu 


Pro 


Gin 


Val 


Glu 


Ala 


J / D 

Ala 


Leu 


Gin 


Pro 


bin 










2 9 0 










Gin 


Leu 


Leu 


Leu 


Pro 


Ser 


Leu 


Leu 


Asp 










one 
J U b 










Ser 


lie 


Ser 


Gly 


Thr 


Tyr 


Asn 


Leu 


Glu 










3 2 0 










Gly 


Leu 


Thr 


Asn 


lie 


Leu 


Asn 


Leu 


Glu 








"3 1 C 
J J J 










Thr 


Gly 


Gin 


Leu 


Asn 


Lys 


Thr 


lie 


Ser 








3 50 










Met 


Val 


Asp 


Met 


Ser 


Glu 


Lys 


Gly 


my, v- 

i nr 










3 65 










Gly 


Leu 


Leu 


Ser 


Gin 


Pro 


Pro 


Ser 


Leu 








380 










His 


Ala 


His 


Phe 


Asn 


Arg 


Pro 


Phe 


Leu 










395 










Thr 


Thr 


Gin 


Ser 


Leu 


Leu 


Phe 


Leu 


Gly 










410 











Ala Gly 



<210> 25 

<211> 114 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5208004CD1 

<400> 25 



Met 


Arg 


Trp 


Arg 


Gin 
5 


Arg 


Ser 


Phe 


Leu 


1 

Ser 


Leu 


Arg 


Gly 


Gly 


Gin 


His 


His 


Pro 










20 










Ala 


Ser 


Ser 


Leu 


Pro 


Phe 


Ser 


Thr 


Leu 










35 










Ser 


Leu 


Ser 


Cys 


Cys 


Leu 


Val 


Ser 


Pro 










50 










Ser 


Phe 


Val 


Leu 


Leu 


Pro 


Trp 


Pro 


Pro 










65 










Ala 


Pro 


Ser 


Pro 


Pro 


Arg 


Gly 


He 


His 










80 










Gly 


Trp 


Gly 


Ser 


Pro 


Ala 


Gly 


Phe 


Leu 










95 










Ser 


Ala 


Ala 


Val 


He 


Phe 


Gly 


Leu 


Ser 










110 











<210> 26 
<211> 742 
<212> PRT 

<213> Homo sapiens 



PCT/IS00/21878 



1 1 11 


Tri r- 


Gly 


Arg 


Gin 


lie 


1 / J 










180 


\j ± y 


Gin 


Val 


Val 


Asp 


Cvs 
j 


i on 










195 


Mot" 


Val 


Leu 


Ala 


Asn 


Tvr 


one; 










210 


Pro 




Ser 


Arg 


i y i 


Gin 


~> 9 n 

Z w 










2 2 5 


Asp 


Glu 


Arg 


Thr 


Ser 


Leu 


tic: 
-j -J 










240 


Mot" 


Hie: 


Arg 


Phe 


Leu 


Tvr 


o ^ n 










255 


bin 


lie 


vj 1U 


TV/ r 

iyr 


Arg 


G ly 


o r 










27 0 


Asp 


Pro 


iy 


T \/c 
ijy a 


Meat - 
rifcr u 


Lys 












28 5 


Thr 


Leu 


Arg 


Lys 


i rp 


Gly 












3 00 


Leu 


hi 1 S 


Leu 


Pro 


Arg 




Tin 
j ± u 










315 


Asp 


lie 


Leu 


Pro 


pin 
bill 


Tip 


J Z D 










3 3 0 


Ala. 


Asp 


rljc 


Ser 


biy 


Val 


"5 <^ n 
J 4 U 










345 


Lys 


val 


Ser 


Ui c 
ri 1 t> 


Lys 


r\ ± d 


~i c a 

JJJ 










J D v 


Pin 


Ala 
nla 


biy 


7V 1 = 


Ala 

e\±cx 


Ser 


J / U 










-lie 


Asn 


Thr 


Met 


Ser 


Asp 


Pro 


385 










390 


Leu 


Leu 


Leu 


Trp 


Glu 


Val 


400 










405 


Lys 


Val 


Val 


Asn 


Pro 


Val 


415 










420 


Leu 


Arg 


Leu 


r/ne 


Leu 


C 1 \r 


i n 
1 u 










15 


Pro 


Leu 


i nr 


Leu 


ri u 


Ser 












30 


i?er 


Leu 


Leu 


Leu 




Ser 


4 0 










45 


Cys 


irl w 


Lys 




Pro 


Gly 












60 


Pro 


Arg 


Arg 


Arg 


Ser 


Gin 


70 










75 


Thr 


Thr 


Gly 


Ser 


Cys 


Trp 


85 










90 


Met 


Pro 


Cys 


Ala 


Gin 


Gly 


100 










105 
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<220> 

<221> misc_f eature 

<223> Incyte ID No : 

<400> 26 



Met 


Pro 


Ala 


Gly 


Gly 


i 








D 


Arg 


Val 


Pro 


Asn 


Arg 

2 0 


Gin 


His 


Ala 


Tyr 


Asn 
3 5 


Ala 


Phe 


Asp 


Lys 


Ala 
R n 


Val 


Gly 


Tyr 


Ser 


Thr 

D D 


Ser 


Phe 


Gin 


Asn 


Glu 

ft C\ 


Val 


Ser 


Val 


Lys 


Val 


Val 


Tyr 


Met 


Thr 


Phe 
1 1 n 


Pro 


Met 


Asn 


Val 


Trp 


Gly 


Thr 


Tyr 


Phe 


Pro 

J.fi u 


Arg 


Thr 


Val 


Leu 


Leu 

ICC 


Asn 


Thr 


Leu 


Leu 


Glu 
1 / U 


Ala 


Arg 


Ser 


Glu 


He 

IOC 

lob 


Ala 


Ala 


Thr 


Val 


Asn 
20 0 


Tyr 


Asp 


Glu 


Glu 


Tyr 

zlb 


Thr 


Pro 


Val 


He 


Leu 
2. J U 


Arg 


Leu 


Thr 


Gin 


Asp 
245 


inr 


Leu 


Lys 


Met 


i v ltr t_ 

o £ n 


Gin 


Gly 


Phe 


His 


Arg 


His 


Phe 


Glu 


Lys 


Met 
o q n 


Tyr 


Ser 


Gin 


Ala 


Phe 








t n c: 


Val 


Ala 


Lys 


Gly 


He 

Ton 
-> £ \) 


Arg 


Ser 


Gly Gly 


Phe 










3 3 5 


Glu 


Arg 


Gly 


Gin 


Arg 
350 


Val 


Lys 


Glu 


Val 


Gin 
365 


Thr 


Glu 


Pro 


Leu 


Thr 

380 


Leu 


Thr 


Glu 


Ala 


Gly 
395 


Glu 


Leu 


Gin 


Gly 


Gin 
410 


Leu 


Thr 


Ala 


Ala 


Arg 
425 


Leu 


Leu 


Asn 


Ser 


Gly 
440 


Arg 


Pro 


Lys 


Pro 


His 
455 
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Lys 


Gly 


Ser 


His 


Pro 










10 


Leu 


He 


His 


Glu 


Lys 










25 


Pro 


Va 1 


Asp 




r T\/r 
j 










40 


Arg 


Liys 


Glu 


Asn 


Lys 










55 


Cys 




i rp 




His 










7 0 


Glu 


I le 


Gly 


Arg 


Leu 
a r 


Asp 


Arg 


pi n 

\j 1. u. 


biu 


O D 

Arg 










100 


V cl X 


^ j. ii 


nld 


111J- 


Ser 










I 1 R 

I I j 


Leu 


1 111 


O v- r\ 


- 

Asn 


Leu 










i m 

IjU 


Pro 


Glu 


ASp 


Gly 


Leu 










TAR 

l 1 ! J 


Arg 


He 


Arg 


pin 
\jlU 


fin 










i en 
lOU 


Asn 


Ser 


bin 


Arg 


v a J. 










l / b 


Ser 


Val 


vjiy 


Asp 


Arg 










ion 


Asn 


Arg 


t_ys 


IrXlfc: 


Pin 
Hi 










ZUJ 


Gly Gly 


relief 


Ala 


Pill 
bill 










•inn 
Z Z U 


Ser 


Phe 


Leu 


Irnc 


Ser 










J 


Gly 


Ser 


Arg 


/via 


Pin 

bin 










Z D U 


Ala 


Asn 


\j ± y 


C 1 \s 


Tip 
11c 










TCC 
ZD J 


Tyr 


Ser 


1 11 L 


Asp 


Arg 










9 ft n 


Leu 


Tyr 


Asp 


Pin 


a 1 a 
/\1 d 










OQC: 
Z7 J 


Gin 


Leu 


e 


Gly 












3 i? 


Leu 


Gin 


iy r 


Val 


Ala 










325 


Tyr 


Ser 


Ala 


Glu 


Asp 










340 


Pro 


Lys 


Glu 


Glv 


Ala 










3 55 


Gin 


Leu 


Leu 


Pro 


Glu 










370 


Ser 


Gly 


Gin 


Leu 


Leu 










3 85 


Asn 


He 


Ser 


Pro 


Ser 










400 


Asn 


Val 


Leu 


Thr 


Val 










415 


Phe 


Gly 


Leu 


Asp 


Val 










430 


Leu 


Glu 


Lys 


Leu 


Phe 










445 


Leu 


Asp 


Ser 


Lys 


Met 










460 



Ser 


Ser 


Thr 


Pro 


Gin 
15 


Ser 


Pro 


Tyr 


Leu 


Leu 

30 


Pro 


Trp 


Gly 


Glu 


Glu 
45 


Pro 


He 


Phe 


Leu 


Ser 
60 


Met 


Met 


Glu 


Glu 


Glu 
7 5 


Leu 


Ser 


Glu 


Asp 


Phe 

90 


Pro 


Asp 


Val 


Asp 


Lys 
105 


Ser 


Gly 


Gly 


Gly 


Trp 
120 


Gin 


Pro 


Phe 


Val 


Gly 
13 5 


Thr 


Arg 


Val 


Gly 


Phe 
150 


Trp 


Lys 


Gin 


Asn 


Lys 
165 


Thr 


Thr 


Ala 


Leu 


Leu 
i fin 

1 O \J 


Gin 


Leu 


Pro 


Pro 


Ser 
19 5 


Gin 


Leu 


Asp 


Glu 


Gly 

910 


Ala 


Pro 


Lys 


Phe 


Pro 

^ ^ j 


Tyr 


Trp 


Leu 


Ser 


His 
240 


Gin 


Met 


Ala 


Leu 


His 

^ j j 


Arg 


Asp 


His 


Val 


Gly 
27 0 


Gin 


Trp 


His 


Val 


Pro 
285 


Gin 


Leu 


Ala 


Val 


Ala 
300 


Glu 


Phe 


Tyr 


Ser 


Asp 
315 


Arg 


Ser 


Leu 


Ser 


His 
33 0 


Ala 


Asp 


Ser 


Pro 


Pro 
345 


Tyr 


Tyr 


Val 


Trp 


Thr 
360 


Pro 


Val 


Leu 


Gly 


Ala 

375 


Met 


Lys 


His 


Tyr 


Gly 
390 


Gin 


Asp 


Pro 


Lys 


Gly 
405 


Arg 


Tyr 


Ser 


Leu 


Glu 
420 


Glu 


Ala 


Val 


Arg 


Thr 
435 


Gin 


Ala 


Arg 


Lys 


His 
450 


Leu 


Ala 


Ala 


Trp 


Asn 
465 
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Gly 


Leu 


Met 


Val 


Ser 


Gly 


Tyr 


Ala 


val 








470 










Gin 


Asp 


Arg 


Leu 


He 


Asn 


Tyr 


Ala 


i nr 








485 










Lys 


Arg 


His 


Met 


Phe 

500 


Asp 


Val 


Ala 


ber 


Cys 


Tyr 


Thr 


Gly 


Pro 
515 


Gly 


Gly 


Thr 


val 


Cys 


Trp 


Gly 


Phe 


Leu 
53 0 


Glu 


Asp 


Tyr 


Ala 


Leu 


Asp 


Leu 


Tyr 


Glu 
545 


Ala 


Ser 


Gin 


GlU 


Ala 


Leu 


Arg 


Leu 


Gin 
560 


Asp 


Thr 


Gin 


Asp 


Gin 


Gly 


Gly Gly 


Tyr 


Phe 


Cys 


Ser 


Glu 










575 










Leu 


Pro 


Leu 


Arg 


Leu 
590 


Lys 


Asp 


Asp 


Gin 


Ala 


Asn 


Ser 


Val 


Ser 
605 


Ala 


His 


Asn 


Leu 


Thr 


Gly 


His 


Lys 


Asp 

620 


Trp 


Met 


Asp 


Lys 


Ala 


Phe 


Ser 


Glu 


Arg 
63 5 


Met 


Arg 


Arg 


Val 


Met 


Val 


Arg 


Ala 


Leu 
650 


Ser 


Ala 


Gin 


Gin 


Val 


He 


Cys 


Gly 


Asp 
665 


Arg 


Gin 


Ala 


Lys 


Gin 


Cys 


Val 


His 


Ser 


Val 


Tyr 


He 


Pro 








680 










Ala 


Asp 


Gly 


Asp 


Pro 
695 


Ser 


Ser 


Phe 


Leu 


Leu 


Ser 


Thr 


Leu 


Arg 
710 


Arg 


Leu 


Glu 


Asp 


Cys 


Glu 


Asn 


Gin 


Ala 


Cys 


Ser 


Val 


Pro 








725 










Leu 


Arg 


Lys 


Leu 


Leu 
740 


His 


Pro 







<210> 27 

<211> 734 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5583922CD1 

<400> 27 



Met 


Trp 


Gly 


Leu 


Leu 
5 


Leu 


Ala 


Leu 


Ala 


1 

Gly 


Pro 


Ala 


Leu 


Gly 


Ala 


Pro 


Arg 


Asn 










20 










Gin 


Pro 


Gly 


Thr 


Thr 


Lys 


Val 


Pro 


Gly 










35 










Ser 


Ser 


Pro 


Ala 


Gin 


Pro 


Pro 


Ala 


Glu 










50 










Glu 


Gin 


His 


Val 


Arg 


He 


Arg 


Val 


He 










65 










Met 


Lys 


Lys 


Arg 


Lys 


Lys 


Leu 


Thr 


Leu 










80 










Val 


Thr 


Ala 


Gly 


Pro 


Leu 


Val 


Thr 


Pro 










95 










Asp 


Pro 


Ala 


Glu 


Lys 


Gin 


Glu 


Thr 


Gly 










110 










GlU 


Ser 


Leu 


Arg 


Val 


Ser 


Asp 


Ser 


Arg 










125 










Gin 


Ser 


Phe 


Gly 


Leu 


Gly 


Pro 


His 


Arg 
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T'Vi v- 

i nr 


pi 

La i y 


f-\ -LCX 


v a L 


Leu 


Gly 


4 / D 










4 80 


Asn 


\j xy 


Ala 

Ala 


Lys 




Leu 


/ion 

4 y if 










4 9 5 




Ar 9 


Leu 


Me t 


Arg 


Thr 


b U d 










D L U 


pin 

Ola 


LI -i e? 
HIS 


Q or- 
o C X 


Asn 


D y r\ 
XT L <J 


r^i u 


jZU 










525 


Phe 


val 


V a 1 


Arg 






JJ J 










54 0 


Ser 


Ala 


LLp 


Leu 


Glu 


Trn 
l l y 


J JU 










555 


Arg 


Leu 


riie 


Trp 


Asp 


Co Y" 












57 0 


Ala 


blU 


Lgu 


biy 


Ala 




con 
bo u 












Asp 


Gly 


A ~\ = 

Ala 


Glu 


Pro 


Ser 


c n c 

b y b 










DUU 


Leu 


Arg 


Leu 


HIS 


pi ./ 


t^lltr 


610 










£1 C 
Di J 


Cys 


val 


Cys 


Leu 


Leu 


11)1 


c o c 

0 Z J 










~k n 

D J U 


Pro 


val 


Ala 


Leu 


Pro 


blU 


640 










04 D 


Gin 


Thr 


Leu 


Lys 


pin 


Tl « 

lie 


655 










DDL) 


Asp 


Thr 


Lys 


Ala 


Leu 


va i 


D / L) 










O / _> 


Asn 


Lys 


val 


Leu 


Tl a 

lie 


Leu 


DOj 










£ q n 

D j u 


Ser 


Arg 


Gin 


Leu 


Pro 


Phe 


700 










705 


Gin 


Ala 


Thr 


Ala 


Tyr 


Val 


715 










720 


He 


Thr 


Asp 


Pro 


Cys 


Glu 


730 










735 


Ala 


Phe 


Ala 


Pro 


A X a 


V ax 


10 










J. J 


Ser 


va l 


Lgu 


o ly 


Leu 


n± a 


Z D 










3 0 


Ser 


inr 


Pro 


A I -a 

Ala 


Leu 


Hie: 
nib 


a n 

4 U 












i nr 


Ala 


Asn 




TV) r- 
llll 


Ser 


_> _> 










6 0 


Lys 


Lys 


Lys 


Lys 


v a x 


He 


7 fi 
/ u 










75 


i nr 


Arg 


XT L D 


1111. 




Leu 


O _) 










9 0 


Thr 


Pro 


A 1 -a 
Aid 


nit/ 


1 il 1 




100 










105 


Cys 


Pro 


Pro 


Leu 


Gly 


Leu 


115 










120 


Leu 


Glu 


Ala 


Ser 


Ser 


Ser 


130 










135 


Gly 


Arg 


Leu 


Asn 


He 


Gin 
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140 










Ser 


Glv 


Leu 


Glu 


Asp 
155 


Gly 


Asp 


Leu 


Tyr 


Glu 


Glu 


Gin 


Asp 


Ala 
170 


Asp 


Pro 


Trp 


Phe 


Pro 


Thr 


Arg 


Phe 


Ser 
185 


GlV 


Val 


He 


Thr 






Tvr 


Asp 


200 


Val 


Thr 


Ser 


Tyr 


Asp 


Ser 


Arg 


Thr 


Tro 
215 


TrD 


Gly 


Ser 


Arg 


Asp 


Ala 


Val 


Phe 


Pro 


Ala 


Asn 


Ser 


Asp 








230 










As n 


Leu 


Leu 


Pro 


Glu 
245 


Pro 


Gin 


Val 


Ala 


Pro 


Gin 


Thr 




Leu 
260 


Gin 


Glv 


Gly 


Ala 


lie 




Ala 


fvq 
uy is 


Pro 
275 


Val 


Ser 


Asp 


Pro 




Pro 


Ala 


Ser 


Glv 
290 


Ser 


Ser 


Asp 


Pro 


nail 




Lys 


Ala 


Met 
305 


Arg 


Lys 


Leu 


Met 




Pro 


Asn 


lie 


Thr 


Arg 


lie 


Tvr 


Ser 








320 










Pi -vr 


Leu 


ijy 


Leu 


Tvr 
335 


Val 


Met 


Glu 


Met 


nib 


Glu 


Leu 


Gly 


GlU 
350 


Pro 


Glu 


Val 


Arg 


Gly 




Glu 


Ala 


Leu. 


Glv 


Arg 


Glu 


Leu 








365 












L6U 




His 


Glu 
380 


Phe 


Leu 


Arg 


Glv 


Leu. 


LeU 


Ser 


Glu 


Met 
395 


Arg 


lie 


His 


Leu 


Asp 




iyi 


Glu 


lie 
410 


Ala 


Tvr 

± y L. 


His 


Arg 


Trp 


rv-Lcl 


Glu 


Gly 


Arg 
425 


Trn 


Asn 


Asn 


Gin 


Asn 


Phe 


Ala 


Asp 


Leu 
440 


Asn 


Thr 


pro 


Leu 


pi \r 


Lys 


Val 


Pro 


His 


lie 


Val 


Pro 


Asn 






455 










1 L1L 


xyi 


TVT" 

xy i. 


Thr 


Leu 


Pro 


Asn 


Ala 


Thr 








470 










Ala 


Val 


lie 


Lys 


485 


Met 


Lys 


Ara 


He 




Leu 


His 


Glv 

j.y 


Glv 
500 


Glu 


Leu 


Val 


Val 


Thr 


Arg 


Thr 


Pro 


TrD 
515 


Ala 


Ala 


Arg 


Glu 


Asp 


Ala 


Val 


Phe 


Arg 
530 


Trp 


Leu 


Ser 


Thr 


Leu. 


Ala 


Met 


Gin 


Asp 
545 


Thr 


Ser 


Arg 


Arg 


Phe 


Ser 


Val 


His 


Gly 
560 


Asn 


He 


He 


Asn 


Val 


Pro 


Glv 


Ser 


Met 
575 


Asn 


Asp 


Phe 


Ser 


Phe 


Glu 


Val 


Thr 


Val 
590 


Glu 


Leu 


Ser 


Cys 


Asn 


Glu 


Leu 


Pro 


Gin 
605 


Glu 


Trp 


Glu 


Asn 


Thr 


Tyr 


Leu 


Glu 


Gin 
620 


Val 


Arg 


Met 


Gly 


Asp 


Lys 


Asp 


Thr 


Glu 
635 


Leu 


Gly 


He 


Ala 
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145 










150 


Asp 


Gly 


Ala 


Trp 


Cys 


Ala 


160 










165 


Gin 


Val 


Asp 


Ala 


Gly 


His 


175 










180 


Gin 


Gly 


Ser 


Asn 


Ser 


Val 


190 










195 


Lys 


Val 


Gin 


Phe 


Ser 


Asn 


205 










210 


Asn 


His 


Ser 


Ser 


Gly 


Met 


220 










225 


Pro 


Glu 


Thr 


Pro 


Val 


Leu 


235 










240 


Arg 


Phe 


He 


Arg 


Leu 


Leu 


250 










255 


Pro 


Cys 


Leu 


Arg 


Ala 


Glu 


265 










270 


Asn 


Asp 


Leu 


Phe 


Leu 


Glu 


280 










285 


Leu 


Asp 


Phe 


Gin 


His 


His 


295 










300 


Lys 


Gin 


Val 


Gin 


Glu 


Gin 


310 










315 


He 


Gly 


Lys 


Ser 


Tyr 


Gin 


325 










330 


Ser 


Asp 


Lys 


Pro 


Gly Glu 


340 










345 


Tyr 


Val 


Ala 


Gly 


Met 


His 


355 










360 


Leu 


Leu 


Leu 


Leu 


Met 


Gin 


370 










375 


Asn 


Pro 


Arg 


Val 


Thr 


Ara 


385 










390 


Leu 


Pro 


Ser 


Met 


Asn 


Pro 


400 










405 


Gly 


Ser 


Glu 


Leu 


Val 


Gly 


415 










420 


Ser 


He 


Asp 


Leu 


Asn 


His 


430 










435 


Trp 


Glu 


Ala 


Gin 


Asp 


Asp 


445 










450 


His 


His 


Leu 


Pro 


Leu 


Pro 


460 










465 


Val 


Ala 


Pro 


Glu 


Thr 


Arg 


475 










480 


Pro 


Phe 


Val 


Leu 


Ser 


Ala 


490 










495 


Ser 


Tyr 


Pro 


Phe 


Asp 


Met 


505 










510 


Leu 


Thr 


Pro 


Thr 


Pro 


Asp 


520 










525 


Val 


Tyr 


Ala 


Gly 


Ser 


Asn 


535 










540 


Pro 


Cys 


His 


Ser 


Gin 


Asp 


550 










555 


Gly 


Ala 


Asp 


Trp 


His 


Thr 


565 










570 


Tyr 


Leu 


His 


Thr 


Asn 


Cys 


580 










585 


Asp 


Lys 


Phe 


Pro 


His 


Glu 


595 










600 


Asn 


Lys 


Asp 


Ala 


Leu 


Leu 


610 










615 


He 


Ala 


Gly Val 


Val 


Arg 


625 










630 


Asp 


Ala 


Val 


He 


Ala 


Val 


640 










645 
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Asp 


Gly 


lie 


Asn 


His 
650 


Asp 


vai 


Tn r 


±11 L 


Trp 


Arg 


Leu 


Leu 


Thr 


Pro 


Gly 


Asp 


Tyr 








665 










Glu 


Gly 


Tyr 


His 


Ser 


Val 


Thr 


Ar cj 


Asn 








68 0 










Glu 


Gly 


Pro 


Phe 


Pro 
695 


Cys 


Asn 


Phe 


val 


Gin 


Arg 


Leu 


Arg 


Glu 
710 


Leu 


Leu 


Ala 


Ala 


Asp 


Leu 


Arg 


Arg 


Arg 


Leu 


Glu 


Arg 


Leu 



725 

<210> 28 
<211> 2080 
<212> DNA 

<213> Homo sapiens 



PCT/l $00/21878 



Ala 


Trp 


p 1 w 

i y 


Gly 


Asp 


y L 


£ c ; 

D _> -J 










660 




v a ± 


1 I IX 




Qav 


Ala 












675 


Cys 


Arg 


Val 


Thr 


Phe 


Glu 


6 8 5 










690 


LiQJi 




Lys 


J. ill 


Pj-Q 


Lys 


700 










705 


Gly 


Ala 


Lys 


Val 


Pro 


Pro 


715 










720 


Arg 


Gly 


Gin 


Lys 


Asp 




730 













<220> 

<221> misc_feature 

<223> Incyte ID No: 088718CB1 



<400> 28 

tgaaggactt ttccaggacc caaggccaca cactggaagt cttgcagctg aagggaggca 60 

ctccttggcc tccgcagccg atcacatgaa ggtggtgcca agtctcctgc tctccgtcct 120 

cctggcacag gtgtggctgg tacccggctt ggcccccagt cctcagtcgc cagagacccc 180 

agcccctcag aaccagacca gcagggtagt gcaggctccc agggaggaag aggaagatga 24 0 

gcaggaggcc agcgaggaga aggccggtga ggaagagaaa gcctggctga tggccagcag 3 00 

gcagcagctt gccaaggaga cttcaaactt cggattcagc ctgctgcgaa agatctccat 360 

gaggcacgat ggcaacatgg tcttctctcc atttggcatg tccttggcca tgacaggctt 420 

gatgctgggg gccacagggc cgactgaaac ccagatcaag agagggctcc acttgcaggc 480 

cctgaagccc accaagcccg ggctcctgcc ttccctcttt aagggactca gagagaccct 540 

ctcccgcaac ctggaactgg gcctctcaca ggggagtttt gccttcatcc acaaggattt 600 

tgatgtcaaa gagactttct tcaatttatc caagaggtat tttgatacag agtgcgtgcc 660 

tatgaatttt cgcaatgcct cacaggccaa aaggctcatg aatcattaca ttaacaaaga 72 0 

gactcggggg aaaattccca aactgtttga tgagattaat cctgaaacca aattaattct 780 

tgtggattac atcttgttca aagggaaatg gttgacccca tttgaccctg tcttcaccga 840 

agtcgacact ttccacctgg acaagtacaa gaccattaag gtgcccatga tgtacggtgc 900 

aggcaagttt gcctccacct ttgacaagaa ttttcgttgt catgtcctca aactgcccta 960 

ccaaggaaat gccaccatgc tggtggtcct catggagaaa atgggtgacc acctcgccct 1020 

tgaagactac ctgaccacag acttggtgga gacatggctc agaaacatga aaaccagaaa 1080 

catggaagtt ttctttccga agttcaagct agatcagaag tatgagatgc atgagctgct 1140 

taggcagatg ggaatcagaa gaatcttctc accctttgct gaccttagtg aactctcagc 1200 

tactggaaga aatctccaag tatccagggt tttacaaaga acagtgattg aagttgatga 1260 

aaggggcact gaggcagtgg caggaatctt gtcagaaatt actgcttatt ccatgcctcc 1320 

tgtcatcaaa gtggaccggc catttcattt catgatctat gaagaaacct ctggaatgct 1380 

tctgtttctg ggcagggtgg tgaatccgac tctcctataa ttcaggacat gcataagcaa 1440 

cttcgtgctg tagtagatgc tgaatctgag gtatcaaaca cacacaggat accagcaatg 1500 

gatggcaggg gagagtgttc cttttgttct taactagttt agggtgttct caaataaata 1560 

cagtagtccc cacttatctg agggggatac attcaaagac ccccagcaga tgcctgaaac 1620 

ggtggacagt gctgaacctt atatatattt tttcctacac atacatacct atgataaagt 1680 

ttaatttata aattaggcac agtaagagat taacaataat aacaacatta agtaaaatga 1740 

gttacttgaa cgcaagcact gcaataccat aacagtcaaa ctgattatag agaaggctac 1800 

taagtgactc atgggcgagg agcatagaca gtgtggagac attgggcaag gggagaattc 1860 

acatcctggg tgggacagag caggacgatg caagattcca tcccactact cagaatggca 1920 

tgctgcttaa gacttttaga ttgtttattt ctggaatttt tcatttaatg tttttggacc 1980 

atggttgacc atggttaact gagactgcag aaagcaaaac catggataag ggaggactac 2040 

tacaaaagca ttaaattgat acatattttt taaaaaaaaa 2080 



<210> 29 

<211> 2225 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 114551CB1 



27/44 



0-10-W3A2 1 > 



W O 01/10903 



PCT/USOO/21878 



<400> 29 

gtgcttggcg 

gggagtctcg 

gcgtcaagcg 

ctgtgcgcgt 

tcccgtttgt 

aggtgcgagc 

acctggacac 

ccctctcaca 

gtttgaagtc 

tccacaccct 

aggatatcac 

agcaagccaa 

tgaataacat 

ctcctctccg 

gggatatcat 

ctcgaaattt 

gtaagaagac 

tgaatgctct 

atgttttcca 

cagaagaaaa 

cttttatgta 

agctcatcat 

agaaggaata 

ctccatatct 

atccaactat 

aagtacaagc 

gcaagccctc 

atgaattaca 

cttacattca 

ggaggcgtct 

aagctgtagc 

ttctggctac 

agctgttctt 

catctagctt 

ctgggagtag 

caagcagaaa 

catttcctga 

aaaaa 



cctgcgctgg 
cggttccact 
ggagcgagat 
gaagcgggag 
gcgggtgaag 
aaagaatggc 
cattaacagg 
catcaatgct 
Ccacgcctac 
caagtcttac 
gtatgtgttg 
attgtcccgg 
aaaggccaat 
gaactacttt 
gttcttgttg 
caaggcacat 
ttttcagatc 
gcactcagct 
ggggtccatg 
agagcagttg 
cctgacgctg 
tccccaagtg 
taagacttac 
aatcttttgt 
tgtcaatttc 
agtacacaag 
cgagggctcc 
agacctccag 
gatttggaag 
agggctttgc 
tgaacacagg 
accagagcac 
tgatcatttt 
cagcagggca 
ttgaagaaca 
gccagtaact 
tttgtatttt 



acgactcggc 
cgcgggaagc 
cgggagcggg 
ttcgagccgg 
cgggagcgcg 
cgagtggatt 
agtgtgctgg 
tatgcctgtc 
attcacagtg 
tgccttccag 
aagcccactt 
gcatatgatg 
gattatgcca 
ctggaagaag 
gtccagcgtt 
gCgtctcccc 
accaaacaag 
ctggggggca 
aggatcttca 
ctccataatg 
gaccttccta 
ccactcttca 
aaggagaact 
atcaagagat 
cctattacaa 
aataccacct 
taccggatcc 
gtgactgaca 
aggcgagata 
tcccaagggc 
ctggctggtg 
caagagccca 
tttctagatt 
gaacccttct 
gataattcct 
tcgctctgtt 
tagatggatt 



cggtagtgga 
gagagtctga 
agcctgaggc 
cgagcgcgcg 
aggtcgatga 
ctgaggaccg 
actttgactt 
tggtgtgtgg 
tccagtttag 
acaactatga 
tcacaaagca 
gtaccactta 
acgctgtcct 
acaattataa 
ttggagagct 
atgagatgct 
gagatggcgt 
caaagaagaa 
ctaaaaagct 
acgagtacca 
ctgcccccct 
acatcctggc 
ttctgaagcg 
tcactaagaa 
atgtggatct 
atgacctcat 
acgtgcttca 
tccttcccca 
atgatgaaac 
tgtggctgat 
ggcttcctag 
cttgcctggg 
gatgctcctt 
ccagatgtgt 
tccaaacatc 
agaggtggag 
aaatagtctc 



gatgtccggc 
gtcgcggggc 
ggcgagctcc 
cgaggccccg 
ggactcggag 
gaggagccgc 
tgagaaactg 
caagtacttt 
ccaccatgtt 
gatcatcgat 
gcaaattgca 
cctgccgggt 
tcaggctcta 
gaacatcaaa 
gatgagaaag 
tcaggcagtt 
tgactttctg 
aaagaagact 
tccccatcct 
ggagacaatg 
ctacaaggac 
taagttcaat 
cttccagctt 
caacttcttt 
gagagaatac 
tgccaacatc 
tcatgggaca 
gatgatcaca 
caaccagcag 
gatggtaaat 
gccagcccag 
atggccccac 
tctcccatgc 
gtaacttatg 
aagccttggg 
gattttccta 
ctgtttttaa 



cggtctaagc 
agctccggtc 
cggggcagcc 
gcttctgttg 
cctgagcggg 
cacCgcccgt 
tgttctatct 
caaggccggg 
ttcctcaacc 
tcctcattgg 
aacttggaca 
attgtgggac 
tctaatgttc 
cgtcctccag 
ctctggaacc 
gtactttgca 
tcttggtttc 
attgtgactg 
gatctgccag 
gtggagtcca 
gagaaggagc 
ggcatcactg 
accaagttgc 
gttgagaaga 
Ctgtctgaag 
gtgcatgacg 
ggcaaatggt 
ctgtcagagg 
ggggcttgaa 
aagaacacag 
cttgtatggg 
actgtcactc 
attgagctcc 
tcttgagtat 
attcttggag 
tggttccccc 
accaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2225 



<210> 30 
<211> 3287 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1261376CB1 



<400> 30 

caggcgcgtt acccggcgtg ctccgcgcgg 
gggaggagga agcggtggct gctgcggatg 
ggcggctgcc gagcgcctga cccgggcctg 
cacacccgct acggtggccc tgcgcccgtt 
gctgtccagg cctgccggct ctggtgtctg 
tgcttgcctg agtcaccgcc gccgccgccg 
gcggtcctcc gggctcccag gatagcgccg 
ccgctgcctc cgcggacgcc aaaatcatga 
aggaattcgc cgtgcccgag aatagctccg 
gttttaaatc acatactgac caacttgtgt 
aagataccCt gagtcagcat ggaattcatg 
cacaaaacag gcctcaggat cattcagctc 
ctacatcatc aactcctaat agtaactcta 
gtttaggtgg ccttggggga cttgcaggtc 
tctctgaact acagagtcag atgcagcgac 
agatcatgga aaatcccttt gttcagagca 
taattatggc caatccacaa atgcagcagt 



cgccggcgaa gggacgtggg ggaaggggca 60 
tcggtgtgag cgagcggcgc ctgaacacac 120 
cgccagagcc tgcaccgagc tccggggccc 180 
gctactgagg cggcgtgctc tgcattcttc 240 
ctggctcctc cttgctcgcc tgctccctcc 300 
ccacagccat ggccgagagt ggtgaaagcg 360 
ccggagccga aggtgctggc gcccccgcgg 420 
aagtcaccgt gaagaccccg aaggaaaagg 480 
tccagcagtt taaggaagaa atctctaaac 540 
tgatatttgc tggaaaaatt ttgaaagatc 600 
atggacttac tgttcacctt gtcattaaaa 660 
agcaaacaaa tacagctgga agcaatgtta 720 
catctggttc tgctactagc aacccttttg 780 
tgagtagctt gggtttgaat actaccaact 840 
aacttttgtc taaccctgaa atgatggtcc 900 
tgctctcaaa tcctgacctg atgagacagt 960 
tgatacagag aaatccagaa attagtcata 1020 



28/44 



1 '0903A2 



WO 01/10903 

tgttgaataa 
tgcaggagat 
gatataatgc 
aagagcagtt 
gtagtcaacc 
cttcccagag 
gtactgccag 
gagctagtat 
cacaacttat 
gccagaatcc 
ctcagcttca 
ctgatacact 
agggtttaca 
tgggggcatt 
gtgaaaacac 
agatgcttca 
agcaacaact 
ctctaatagc 
catcatagca 
aactttaaaa 
aacccaatat 
ttcttttctg 
attttttaaa 
gtttatttgc 
aaagaagcaa 
gcatgcattt 
tatacaatat 
ctctgagcaa 
tatttcctaa 
cggttagtca 
acaattaatt 
gtaaattacc 
ttgcctagga 
gaaatgttgc 
aaaatagaac 
aaatgagttg 
aatattgtgc 
aatctgtcct 



PCT/USOO/21878 



tccagatata 
gatgaggaac 
tttaaggcgc 
tggtggtaat 
ttcccgtaca 
ttcatcagct 
tggcacttct 
gttcaacaca 
gcaaaacatg 
tgaccttgct 
agaacaaatg 
atcagcaatg 
gacattagca 
aggaagcact 
aagtcccaca 
ggctcttgct 
ggaacaactc 
aacaggaggt 
gcatttctgt 
tacctgcttt 
gatgcatttt 
gaacagtggg 
aacatcacct 
tttttaaaca 
atcatttgca 
ttgcaaacaa 
agagtatgca 
aacacctgta 
ttgtatagaa 
ggtttgttga 
gtacagacac 
ttaatctttg 
cttctcagct 
tttcactgaa 
tatcttttgg 
tgagccaatt 
ttttcagagc 
ggatggggcc 



atgagacaaa 
caggaccgag 
atgtacacag 
ccatttgctt 
gaaaatagag 
tccagcggca 
gggcagagta 
ccaggaatgc 
ttgtctgccc 
gcacagatga 
agacaacagc 
tcaaacccta 
acggaagccc 
ggaggctctt 
gcaggaacca 
ggagtaaatc 
agtgcaatgg 
gatatcaatg 
atcttgaaaa 
atttcatttt 
aaggtggagt 
aattaaggct 
tttatagttg 
ttagcctatg 
ctctataatt 
tgctgtaaga 
catttgggac 
actaaaaaag 
tcttacagca 
aaatacagta 
ataatgtaac 
tgcagactga 
tctcccatag 
aaagtgtctt 
gtttatctgt 
gtttctgaag 
ctcagagaca 
agcttaaata 



cgttggaact 
ctttgagcaa 
atattcagga 
ccttggtgag 
atccactacc 
ctgccagcac 
ctactgcgcc 
agagcttgtt 
cctacatgag 
tgctgaataa 
tcccaacttt 
gagcaatgca 
cgggcctcat 
caggaactaa 
ctgaacctgg 
ctcagctaca 
gatttttgaa 
cagctattga 
aatgtaattt 
gactcttgga 
acagtaagat 
actgcatgca 
ggtgaccaga 
gtagtaattt 
tgtggtacag 
tttatactac 
tgcatttctg 
tgaagataag 
tctttgacaa 
gaaaagc tga 
attgtctcaa 
aggaacactg 
gtagtttaac 
gatgtttcag 
tttctcatgc 
tgttttggta 
gggggacgcg 
atactggcaa 



tgccaggaat 
cctagaaagc 
accaatgctg 
caatacatcc 
caatccatgg 
tgtgggtggc 
aaatttggtg 
gcaacaaata 
aagcatgatg 
tcccctattt 
cctccaacaa 
ggccttgtta 
cccagggttt 
tggatctaac 
acatcagcag 
gaatccagaa 
ccgtgaagca 
aaggttactg 
atttttgata 
attctgtgct 
gtgtgggttt 
tcacttctgc 
ttttgtcctg 
atgtagaata 
tattgcttat 
tgataatttt 
gaaacatact 
aaaatactct 
acatctccca 
ttctggttat 
cattcattca 
tagtataccc 
aggcattaaa 
ttatttttaa 
acaggcaata 
gttctattaa 
gtgccggggt 
ccaagat 



ccagcaatga 
atcccagggg 
agtgctgcac 
tctggtgaag 
gctccacaga 
actactggta 
cctggagtag 
actgaaaacc 
cagtcactaa 
gctggaaatc 
atgcagaatc 
cagattcagc 
actcctggct 
gccacaccta 
tttattcagc 
gtcagatttc 
aacttgcaag 
ggctcccagc 
acggctctta 
gttataaaca 
ttctgtattt 
atttattgta 
catctgtcca 
aaagcattaa 
tgtgactttg 
gttttatttg 
gcaataggct 
taaagctgag 
gcaaaagtgc 
ctctttaagg 
cagattgact 
caaagtgcat 
atttgtaatt 
tcgccataaa 
cacaaattta 
gaaatagtta 
ggggcagcgg 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3287 



<210> 31 
<211> 2412 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> incyte ID No: 1299481CB1 



<400> 31 

gggctgtgtg cggcggcggc ggcggcggcc 
cagagttgaa caatgaccat agttgacaaa 
cagaatcagc ctggcagctc cgaggcagtc 
agctggggtg ctgtgtcttc attgaatgat 
gtacctggtg ctgtagttta ttcgagttca 
caaaaggatc aagccctagg tgatggcatc 
gagaagattt gtcttaagtg gcaacaaact 
ggcaatacct gttttgccaa tgcagcactg 
aattacatgc tatcacatga acactccaaa 
tgtacaatgc aagcacatat tacccaggca 
atgtttgtca tcaatgagat gcggcgtata 
gatgcccatg aattccttca atacactgtt 
agcaataaat tagacagaca cacccaggcc 
tacctaagat ctagagtcaa atgtttaaat 
tatcttgata taacattgga gataaaggct 
tttgtgaagc cggaacagct tgatggagaa 
atggttccag cttcaaagag gttcactatc 
ctgaaacgtt ttgcaaattt taccggtgga 



gagggggatg gagcgagcgc cgagccgggt 60 
gcttctgaat cttcagaccc atcagcctat 120 
tcacctggag acatggatgc aggttctgcc 180 
gtgtcaaatc acacactttc tttaggacca 240 
tctgtacctg ataaatcaaa accatcacca 3 00 
gctcctccac agaaagttct tttcccatct 360 
catagagttg gagctgggct ccagaatttg 42 0 
cagtgtttaa cctacacacc acctcttgcc 480 
acatgtcatg cagaaggctt ttgtatgatg 540 
ctcagtaatc ctggggacgt tattaaacca 600 
gctaggcacc tccgttttgg aaaccaagaa 660 
gatgctatgc agaaagcatg cttgaatggc 720 
accactcttg tttgtcagat atttggagga 780 
tgcaagggcg tttcagatac ttttgatcca 840 
gctcagagtg tcaacaaggc attggagcag 900 
aactcgtaca agtgcagcaa gtgtaaaaag 960 
catagatcct ctaatgttct tacactttct 1020 
aaaattgcta aggatgtgaa ataccctgag 1080 



29/44 
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WO 01/10903 



PCT/lSOO/21878 



tatcttgata ttcggccata tatgtctcaa cccaacggag agccaattgt ctacgtcttg 1140 
tatgcagtgc tggtccacac tggttttaat tgccatgctg gccattactt ctgctacata 1200 
aaagctagca atggcctctg gtatcaaatg aatgactcca ttgtatctac cagtgatatt 1260 
agatcggtac tcagccaaca agcctatgtg ctcttttata tcaggtccca tgatgtgaaa 1320 
aatggaggtg aacttactca tcccacccat agccccggcc agtcctctcc ccgccccgtc 1380 
atcagtcagc gggttgtcac caacaaacag gctgcgccag gctttatcgg accacagctt 1440 
ccctctcaca tgataaagaa tccacctcac ttaaatggga ctggaccatt gaaagacacg 1500 
ccaagcagtt ccatgtcgag tcctaacggg aattccagtg tcaacagggc tagtcctgtt 1560 
aatgcttcag cttctgtcca aaaccggtca gttaataggt cctcagtgat cccagaacat 1620 
cctaagaaac aaaaaattac aatcagtatt cacaacaagt tgcctgttcg ccagtgtcag 1680 
tctcaaccta accttcatag taattctctg gagaacccta ccaagcccgt tccctcttct 1740 
accattacca attctgcagt acagtctacc tcgaacgcat ctacgatgtc agtttccagt 1800 
aaagtaacaa aaccgatccc ccgcagtgaa tcctgctccc agcccgtgat gaatggcaaa 1860 
tccaagctga actccagcgt gctggtgccc tatggcgccg agtcctctga ggactctgac 1920 
gaggagtcaa aggggctggg caaggagaat gggattggta cgattgtgag ctcccactct 1980 
cccggccaag atgccgaaga tgaggaggcc actccgcacg agcttcaaga acccatgacc 2040 
ctaaacggtg ctaacagtgc agacagcgac agtgacccga aagaaaacgg cctagcgcct 2100 
gatggtgcca gctgccaagg ccagcctgcc ctgcactcag aaaatccctt tgctaaggca 2160 
aacggtcttc ctggaaagtt gatgcctgct cctttgctgt ctctcccaga agacaaaatc 2220 
ttagagacct tcaggcttag caacaaactg aaaggctcga cggatgaaat gagtgcacct 2280 
ggagcagaga ggggccctcc cgaggaccgc gacgccgagc ctcagcctgg cagccccgcc 2340 
gccgaatccc tggaggagcc agatgcggcc gcaagcttat tcccctttag tgagggttaa 2400 
ttttagcttg ca 2412 



<210> 32 
<211> 2286 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1873139CB1 



<400> 32 

gaaaacccgg cgtccggaag cccgtgcttt 
cgtcggccga gcgcccggct agaagcgaca 
ccacctcgcc gggtcctgga gccgcagtcc 
gtccacacct gagctccagc gtctgcattg 
ccccgtcgtc ctggtgctgc agcgttatcg 
gctgggactg gtacagaaag tcagagaaca 
agctgacagg cataagaaaa gaaaactttt 
aaaagtaaat ggaagcacca ctgccatttg 
atgtttcatg aatgccatcc ttcagtcact 
caaagaactg cccgccgtgg agttaaggaa 
caccaggagc caaggggata acaatgtgtc 
tgctttatgg caaggcagcc agactgcatt 
gaagattatg ccaaacttta ggggctatca 
ccttttggac cacctacact tggaacttca 
aattctgcag gagaattcta ctctgtctgc 
tactgttgtc acggctatat tcggaggcat 
tgggacagaa tctagaaagt ttgatccatt 
gttcagaagt aagcgctcta agaatcaaga 
tcttcgcagt tttaccgact tagaagaact 
atgcaaaaag aaacaaaagt ccacaaaaaa 
atgcttacat ttgaaaagat ttcattggac 
cgtagaattt ccactgagag gcctagacat 
tggcccggag agctgcctgt atgacctcgc 
tggttctgga cattacacag catacgcaac 
cagtactgta acactgactg acgaggagac 
ctacgtggaa caccaggcca aagctggatc 
cattcaccaa ccataccaga gaaacatttc 
taatttcatt atgcactttt caatttccta 
cttactgaac atgggcacca actaattttg 
gttttgattt gctgctttag ttgtaataat 
gtcttatttg acttttttat tttatgttaa 
acatcaatgc ttttgttcct ctcacatgct 
gcgacacaac tgcctgctgc ctttccacag 



ctttgacgca agggctcgag acgcagccgc 60 
ccagacggag cctccggagt tcctccgccc 120 
tcccagctgc cctcctcgtg gccatggagt 180 
ctccggactc agccaagttc cccaacggct 240 
ctgtgaCgat tttgtggtta atgacaccaa 300 
cttacagaac ttggaaaact cagctttcac 3 60 
ggaaaactca acactaaaca gcaagttatt 420 
tgccacaggc cttcggaatt tggggaacac 480 
cagtaacatt gagcagtttt gctgttattt 540 
tgggaaaaca gcaggaaggc ggacatacca 600 
tttggtagaa gagtttagaa agacactctg 660 
tagcccagag tccttatttt atgttgtttg 720 
acagcaggac gcccatgaat tcatgcgcta 780 
gggcggtttc aacggtgttt cccgctcagc 840 
aagtaacaag tgttgcataa atggagcatc 900 
tctccaaaat gaggttaact gcctcatatg 960 
cctagacctt tcattagata ttccaagtca 1020 
aaatggacca gtttgttcgt tacgagattg 1080 
tgatgagaca gagttatata tgtgccataa 1140 
gttttggatt caaaaactac ccaaggtgct 1200 
agcatattta agaaataaag ttgatacata 1260 
gaaatgctac ttactagagc ctgagaacag 1320 
cgctgtggtg gtgcaccatg gttccggggt 1380 
tcacgaaggc cgctggttcc acttcaatga 1440 
tgtggtgaag gcgaaggcct acatcctttt 1500 
ggataaactt taatacctcc tccaaatcat 1560 
cagttttcca caaatacttg atacaagatt 1620 
ttttggattt agttttgtca atggtagtga 1680 
ttgttgttct accagaaaac ctcagcagat 1740 
tcaattttta taggtagttg taagaactta 1800 
cgttttcagt tctcactttg aggcacattt 1860 
gaaagcaaga tgtgttcctt attgtgaaga 1920 
ctataatgga catcaggttg actctaaatc 1980 



30/44 



<W3 
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aaggatcatg 
cattcttttt 
cttgtctctc 
gtacttgtcc 
acttgtttgc 
aaaaaa 



PCT/USOO/21878 



tgtgcacaat 
gcctgtaaaa 
aaggctgctg 
tgcaaataca 
ggggaggtaa 



acttgtggcc 
tataacaaag 
tctttatcag 
agaattactc 
gatgggagta 



cacaaaattt 
ggcatcatta 
cactaactaa 
tctttgttgg 
aagaccaaat 



cacaatgact 
agtagaccag 
ataaatttgt 
tttttctggt 
acatgtaatg 



gctgaggaat 
gtaattactg 
tggttcagtt 
tttggggcat 
tttaaaaaaa 



2040 
2100 
2160 
2220 
2280 
2286 



<210> 33 
<211> 873 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1903112CB1 



<400> 33 

taaaaacatt tatcatcttt gaaagcactc 
aaatgttctc ccttttttga cagggtcaaa 
aatatctagc ttctcgatat gatttcattg 
cctatggagg atacctctcc ctgatggcat 
ctattgctgg ggccccagtc actctgtgga 
atatgggtca ccctgaccag aatgaacagg 
cagaaaagtt cccctctgaa ccaaatcgtt 
atgtccattt tgcacatacc agtatattac 
atgatttaca gatctatcct caggagagac 
attatgaact gcatcttttg cactaccttc 
taaaagtgat ataattttga cctgtgtaga 
aatgaggagg tttaatcaat agaaaacaca 
tgtaacatct actcctgaaa ataaatgtgg 
taatctaata ccttaacccc acatgctcaa 
agcaatacca taagaattac taaaaaaaaa 



ttacgagact tttattcatc agaatcagtg 60 
tagaaattga cgatcaggtg gaaggactcc 120 
acttagatcg tgtgggcatc cacggctggt 180 
taatgcagag gtcagatatc ttcagggttg 240 
tcttctatga tacaggatac acggaacgtt 3 00 
gctattactt aggatctgtg gccatgcaag 3 60 
tactgctctt acatggtttc ctggatgaga 420 
tgagtttttt agtgagggct ggaaagccat 480 
acagcataag agttcctgaa tcgggagaac 54 0 
aagaaaacct tggatcacgt attgctgctc 600 
actctctggt atacactggc tatttaacca 660 
gaattgatca tcacattttg atacctgcca 720 
tgccatgcag gggtctacgg tttgtggtag 780 
aatcaaatga tacatattcc tgagagaccc 840 



<210> 34 

<211> 1658 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1993044CB1 



<400> 34 

ggaagagttt cgatgtctct agggtggcta 
ggtgacggcg cccggaggct gtcgggaagt 
gcggcgggtt tgctgagatc tgtggccggc 
agaggtggat cggggcggtg tgtggccagg 
ctcatggaaa gcgaccccgg ggtcttcacc 
gcccaagtag aagaaatatg gagtttagag 
catgggttaa tttttctttt caagtggcag 
caggactccc gacttgacac gatatttttt 
actcaagcca tagtgagtgt gttactgaac 
acattatcag agtttaaaga attttcacaa 
ctgagcaatt cagatgtgat tcgacaagta 
gaatttgata cgaagacatc agcaaaagaa 
cctgttaatg ggagactgta tgaattagat 
gcatgcaatc aagatgattg gttcagtgca 
aagtacagtg aaggtgaaat tcgatttaat 
atatatgagc agaagatagc agagttacaa 
acagatcaag gtaatagtat gttaagtgct 
cttattgaag aagaagtaca gaaattaaaa 
cataattatc tgcctttcat tatggaattg 
ataccactag tagaaaaggc aaaagaaaaa 
tgaagatgtt ttcagatatg tacacatttc 
tatgtataaa gaacttagag caacatccta 
cagcctgtct tgtctttaat gcatggattc 
atgtagtgca tattaaatga aagtgatatt 



gagcgtcctc ccgcgctcag tcgcgctgca 60 
atgcggggtg acgtgtggtt gacgagctcg 120 
ggcagctggt gcggggggca gctgagagcg 180 
gccatgacgg gcaatgccgg ggagtggtgc 240 
gagctcatta aaggattcgg ttgccgagga 300 
cctgagaatt ttgaaaaatt aaagccagtt 360 
ccaggagaag aaccagcagg ctctgtggtt 420 
gctaagcagg taattaataa tgcttgtgct 480 
tgtacccacc aggatgtcca tttaggcgag 540 
agttttgatg cagctatgaa aggcttggca 600 
cacaacagtt tcgccagaca gcaaatgttt 660 
gaagatgctt ttcactttgt cagttatgtt 720 
ggattaagag aaggaccgat tgatttaggt 780 
gtaaggcctg tcatagaaaa aaggatacaa 840 
ttaatggcca ttgtgtctga cagaaaaatg 900 
agacaacttg cagaggagga acccatggat 960 
attcagtcag aagttgccaa aaatcagatg 1020 
agatacaaga ttgagaatat cagaaggaag 1080 
ttaaagactt tagcagaaca ccagcagtta 1140 
cagaacgcaa agaaagctca ggaaaccaaa 1200 
tgcttctgca catattttca tggaaaccat 1260 
attggctcag tgcacgtttg gcaatagtgc 1320 
ataaacttct tccctacctg catcatgtgc 1380 
aagaatgctt tcccaaattc cattatttga 1440 
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catttgaggt cctgaccact gtttaagttt tcctgggttg tcctaactaa ccattaatgg 1500 

aggcctgaga aaatggcgcc aaacggatta ttgctttaat ctgtaaattt aaaataacgt 1560 

aacaattttg cacattggtc agatccttgg attaaaccgg gttggtccta atttgcttcc 1620 

tccatccata accgggaaaa aaaattccgg gttttccc 1658 



<210> 35 

<211> 1255 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2292182CB1 



<400> 35 

gcctaacgtc 

gtcgcgttgg 

ggaagcagcg 

ccccgcggca 

tctggcccag 

gtgccagctt 

tgctatgaaa 

taatggaaat 

taatatccat 

tgatcattgt 

ggttcgagct 

ttttggatta 

ggctgttagg 

ttgtttcaat 

tgctcacaga 

tttttatatt 

aaacatataa 

agcatattat 

gaaattgtat 

gtcatttgtt 

gctcttccat 



ttcagcccag 
cgggagggag 
gcgaggtctg 
ggggagcagc 
gggaatcccc 
aggctcctga 
cactcaggtt 
gtcagtggag 
aatcaggccc 
cgtgcccatg 
gctaacctta 
aaacaacacc 
aatatcagca 
gaccatgaac 
gactttgaaa 
acagagcttc 
aatgtaaaca 
cagaaaaagt 
ctttatagct 
tttattacat 
gatgttgata 



ccaggctgcc 
gttacctttc 
cgggaggcat 
tgcagcagca 
agcaagggtt 
agacgctgga 
gtgctgttaa 
gttttgatgc 
atatgaacag 
tcgactggtt 
gtggagactg 
accagacttg 
aagaagtagc 
cttttggaag 
accgtgatcg 
catcatcatg 
atccctacaa 
aactatggca 
aaccatttag 
gtgattattt 
ggtgaccact 



cttcttccct 
ccagtctcgc 
ggcgggagct 
acacgtctct 
cttctccagc 
gacaaatcca 
caaagataga 
ttcaacatct 
agtggtcaca 
caccaacatc 
ctcacttgtc 
tgtgcgagac 
taaaaaggct 
gatcccacat 
gtattattca 
aagaaaaaaa 
tccagataaa 
aaaaatgaaa 
taaataaata 
taaaatttga 
gttggtattt 



gcggagggag 
tctggccgcc 
ccggacgagc 
tgccaggtct 
ttcttcacca 
tatgtcaaac 
cacttttctt 
cagatagttt 
cacgagctta 
agacatttgg 
aatgaaatat 
agagccactc 
gttgatgaag 
aacaagactt 
aatatatgag 
aatttggttc 
acagagaaga 
ctgctttaaa 
cattacattt 
ttatgaaacc 
acattaaatg 



ggcctgggcg 
tgagccagga 
gccggcgggg 
tccccgagcg 
gcaaccagaa 
ttctgcttga 
gcgaagactg 
tgtgccagaa 
ttcatgcatt 
cgtgctcaga 
tcaggttaca 
tttctatcct 
tttttgaatc 
atgcaagata 
cacaatgaca 
tcaagtgaac 
ctgtgattct 
ctccaaggca 
ttgtatttta 
tgtgaagtgt 
taaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1255 



<210> 36 
<211> 1139 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2331301CB1 



<400> 36 

tgcctttaaa aaaaaaaaag gcttctaatc 
ttgatgcaat gaaataccgt gtgactcttc 
cacatccagt ggacgcccag ggagatggag 
aggcgctatg tccagaggct gcagtggctg 
gttttggaga gcaaagtatg catattgctg 
cagcaggtga agacagagct ggttttgctg 
agttttaacc tgctcagctt tgcagagagc 
accacagatg cagcgtgtca tgaggctatg 
agcacctcca tcttgcaagc attgctgaaa 
tacctcctga ccgacggaaa gccagacaca 
aaactcaggg agaaaagaga tgtgaaagtg 
gcggcggttg agttcctgag aaagctggct 
gtgggtgagg acacactctc caaaattcac 
aaggatcgca cattgccacc atttgaagga 
accaaggcca gaagcttcct ctggcaggcc 
aatgatgcag aaccaaaggt cactctttcc 
caaaggacca ctcactgagc aaatctcagc 
acctgcagga aacatgattc ctggtaccag 
ttgtcttttg tgtgaggggc cattcccggg 



ctctggggaa agatctgctt ccttccccat 60 
cccagcgttg agatccatgg agtggtgaga 120 
gtgtacatca ggcacttgga gaaggtgtta 180 
ctgtccggga gccgccgact gtttggcacc 240 
gacacgtcag ggtccatggg cccctacctg 300 
atttgggaac agctgcggaa gtgctgtgac 360 
cttcagtcat ggcaggacac gctggtggag 420 
caatgggtga cccacctgca agctcagggc 480 
gctttcagtt tccatgatct ggaaggattg 540 
agctgcagcc ttgtcctaaa tgaagtccaa 600 
cacaccattt ccttgaactg ctcagacaga 660 
tccttcaccg gcggacgcta tcactgccct 720 
agcctgctga ccaaaggctt catcaatgaa 780 
gatgatttaa ggatcctggc ccaggaaatc 840 
caatccttca gatcccaact ccagaagaaa 900 
tagagaagtg ttctcagaaa agtcatcctg 960 
cccgagggca ggatgggatg aaatgctgtg 1020 
gactctctgg aagctgagga aggaaagact 1080 
tcttaatcta actctccagc cctgtccgg 1139 
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<210> 37 
<211> 1753 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2B17512CB1 



<400> 37 

gtggcggact agcccaggct 
agcatctcag gttccctctg 
ccagaggccc ccgcatccac 
atttggccca gctcatgaat 
agatcctgga aaaagccatt 
ttgtggaagc aatggtaaat 
atttttgcac acatcttcct 
ccttggaaaa gatccagcct 
agcatcttgc atctatatat 
tgggaattcc tttggaaaca 
acttgaagat tgctaggcta 
taaatcgagc atcgttgctt 
aggtatgcta tgcacgtgtt 
acaatgagct ctcttacaag 
atgctttgca ctgtacgatc 
ctctttttaa ggatgaaagg 
atctagatag gatcatcaga 
accaaaaagc aactacagct 
atttgttgtc tgcaagcaaa 
tagagatccc tgcagctaag 
tgaatggatt tattgaccag 
caacgtggga taagcagatc 
ttagtcaaac agcaccagaa 
tccttgcaga acttctgtgc 
atctccaaag catttgcatc 
tttaaagaag acattattag 
atttcagggt ctctgtgtat 
gaggtgaaat atctgtggct 
acttaagtgc ttcatgcctc 
aaaaaaaaaa aaa 



cccacacccg acgctctggc 
gctgcactct ggaggaccac 
cgctgagctg ggagaaagat 
tcgagcggct ctcataaaga 
cagttatctg gagcagaaca 
gagaatgtca gtctcgtgat 
aacttgcctg atagcacagc 
agagtcattt catttgagga 
gagaaagaag aagattggag 
ggacaaaaac agtacaatgt 
tatctggagg atgatgatcc 
cagaatgaat caaccaatga 
cttgattata gaagaaaatt 
acaatagtcc acgaaagtga 
ttagcatcag cagggcagca 
tgccagcaac ttgctgccta 
ggaaatcaac ttcaagaatt 
gatggttcca gcatcttgga 
ttatataata atattacctt 
gcggaaaaga tagcatctca 
attgatggaa tagttcattt 
caatcacttt gtttccaagt 
tggacagcac aagccatgga 
acatgacatc tttttccatg 
atgaccttat acatttcaat 
agcaggaagt acaagcattt 
taagctaact cagatgtttt 
aaaaagtttg agatttgtga 
tccaaatgtg gttattctaa 



ccacacagac gctactctgt 60 
actcgttttc tttttggctg 120 
ggcggcagcc gtgcgacagg 180 
tctggctggc aagtatcgtc 240 
actagaagct ttgaaagctt 300 
ctcgcggcag ttgctgactg 360 
caaagaaatc tatcacttca 420 
gcaggttgct tccataagac 480 
aaatgcagcc caagtgttgg 540 
agattataaa ctggagactt 600 
agtccaggca gaggcttaca 660 
acaattacag atacattata 720 
cattgaagct gcacaaaggt 780 
aagactagag gccttaaaac 840 
gcgttctcgg atgctagcta 900 
tgggatccta gagaaaatgt 960 
tgctgccatg ctgatgcctc 1020 
cagagctgtt attgaacaca 1080 
cgaagaactt ggagctcttt 1140 
aatgataacc gaaggacgta 1200 
tgaaacacga gaagccctgc 1260 
gaataacctt ttggagaaaa 1320 
agcccagatg gctcagtgaa 13 80 
ttgtgcagat cagtttcact 1440 
cccttttatg ctggattccg 1500 
aaaatatgta gttcccatat 1560 
gaaagctttt tctttaaaca 1620 
taactttgta gtcatgtaaa 1680 
taaatggaga aatgagccaa 174 0 

1753 



<210> 38 

<211> 580 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3489039CB1 



<400> 38 

cgaggcgccc gaagccgtcg cggcggggac 
tacaccaggg gttggaaaaa ccacactagg 
atacattaat gtgggtgatt tagctcgaga 
gtatgactgt cccattttag atgaagacag 
agaaggtgga gttattgttg attaccatgg 
tatagttttt gtgctgagaa cagataccaa 
ttataatgag aagaaactaa cagacaatat 
agaagccaca gcatcctaea aggaagaaat 
agagctagaa aataatgtag atcagatctt 
taactcttga cttataaggc tagctactta 



catgttgctt ccgaacatcc tgctcaccgg 60 
caaagaactt gcgtcaaaat caggactgaa 120 
agagcaattg tatgatggct atgatgaaga 180 
agtagttgat gagttagata accaaatgag 240 
ttgtgatttc ttccctgaac gctggtttca 300 
tgtattgtac gaaagacttg aaacaagggg 3 60 
tcagtgtgag atttttcaag ttctttatga 420 
cgtgcatcag ctgcccagta ataaaccaga 480 
gaaatggatt gagcagtgga tcaaagatca 540 
ataatcactc 580 



<210> 39 

<211> 2186 

<212> DNA 

<213> Homo sapiens 



33/44 



BNSDOCID <WC 011O9O3A2 1 > 



WO 01/10903 



PCT/LSOO/21878 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5432879CB1 



<400> 39 

gtccggcggc 

atgctgagct 

ctgcggaccg 

attgctgctg 

aagcgtagaa 

tccctgctac 

tcccagtact 

ttgcaccttc 

agctgcttgt 

gatgctcacg 

cctcgggtca 

aaacaaatta 

caacatcctt 

agtcctgttc 

ggtcacccat 

gatgttgtgt 

gtggaacacc 

ctctgcatcc 

gagcacgtgc 

cataaaccta 

caggatgggc 

atttttatga 

ttccctctcc 

gttgtcgtcc 

ccttctgcca 

gtccgcaagg 

cgcgtccttt 

cctcctgcaa 

tactgttgcg 

gcaggctcca 

gttcatgtgt 

ttttactcat 

attgtaatgt 

gatcttttat 

taaaggcaac 

cattgtgaaa 

ctaaagaaca 



ggcggcggcg 
cccgggccga 
gggcggccgt 
ctcttgcagc 
aagggcttgt 
aaggcctgtc 
ccagggatca 
tgaaagcctt 
tggatgtctt 
aattattcca 
cacatttgtt 
cctgccgcac 
ttcatggaag 
gatttgatac 
tgaccctgga 
gtgacaactg 
agaggaccac 
acctacagcg 
agttcaatga 
gtcaacacaa 
cgggagcccc 
atggcgcctg 
cagttgttcc 
accatggaga 
ggaaccctct 
ccagcctgca 
ccaggatgca 
ggctagagct 
tgtgcaagcg 
cctgggagcc 
gtaggtggtt 
ctgatacagg 
taggtgttct 
ttttaataag 
aatttagaag 
tattccttcc 
ccttcctttt 



gtagcggagg 
ggcggcgatg 
cagatataaa 
aggaatatat 
gcctggcctt 
tgcctgtcct 
gaaggagccc 
gtcctgccaa 
aagaatgtac 
tgtcattacc 
tgatgtgcat 
aagagggtca 
actcactagt 
ctttgatagc 
ccactgcctt 
tacaaagatt 
ttttgttaaa 
gctgagctgg 
gttcctgatg 
ccctaaactg 
cacaccagtt 
ctccccatct 
cgactacagc 
catgcactct 
ctcaactagc 
ggaggtcctg 
gcaccagagc 
gatggcactg 
gccccactag 
agccccagtt 
ctgttgtgtt 
taattaaaag 
cagaggggag 
caggcccata 
aaaaagtgcc 
aggctactca 
tctatg 



agacggt ttc 
accgcggccg 
gtcatgaaga 
gttatttggg 
gttaatttag 
gctttcatca 
ccctcacacc 
gaagttactg 
agatggcaga 
tcgtcattgg 
tccctggagc 
cc tcacccca 
aatatggtct 
ctttcactaa 
caccacttca 
gaagccaagg 
cagttaaaac 
tccagccacg 
atggacattt 
aacaagaacc 
ctgaatcagc 
ttattgccaa 
tcctccacat 
ggacactttg 
aatcagtggc 
tcctccagcg 
caggagtgca 
tctgcactgt 
agccttccag 
cacaccaaac 
aagaaagcat 
aactcagatt 
gtacctttgt 
aaaattgttg 
tttcactttc 
aaggatagca 



aggcctccgg 
acagggccat 
actggggagt 
gtcccattac 
ggaacacctg 
ggtggctgga 
agtatttatc 
atgatgaggt 
tctcatcatt 
aagatgagcg 
agcagtcaga 
catccaatca 
gcaaacactg 
gtattccagc 
tctcatcaga 
gaacgttgaa 
tagggaagct 
gcacgcctct 
acaagtacca 
cagggcctac 
caggggcccc 
cgctgtcagc 
acctcttccg 
tcacttaccg 
tgtgggtctc 
cctacctgct 
agtctgaaga 
ccaggaaaaa 
ccttctggtg 
caggctccct 
tcattatgtc 
cttgaagcca 
ctaatcaacg 
acaagaatta 
gattgctttt 
agagaacagg 



tgcggctgca 
ccagcgcctc 
tataggtgga 
agaaagaaag 
cttcatgaac 
agagttcacc 
cttaacactc 
cttagatgca 
tgaagaacag 
agaccgccag 
aataactccc 
ctggaagtct 
tgaacaccag 
cgccacatgg 
atcagtgcgg 
cggggaaaag 
ccctcagtgt 
gaagcggcat 
cctccttgga 
actggagctg 
caaaacacag 
gccgatgccc 
gctgatggca 
acggtcccca 
cgatgacact 
gttctacgag 
atgactgtgc 
agtaaaactg 
tgttctaaga 
gaacagtcct 
cggagtgtct 
ccgttttcat 
tttccactta 
atgaaattat 
gtagcacgtc 
taaatgatgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2186 



<210> 40 

<211> 2522 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5853753CB1 



<400> 40 

ctccgggcgt gctgccggcg gcggtaggtg 
gcgggaccgg agctgaggca ggaagagccg 
gcggcattag cgaggaggag gcggcacagt 
aggcccagaa acggctgcgg gcctctcggg 
ctgaaattgc caagaatctc atcttggcag 
aacaggtaac tccagaagat cccggagctc 
gaaatagggc tgaagcctct ttggagcgag 
aggtggacac tgaggatata gagaagaaac 
tgtgtctgac ttgctgctcc agggatgtca 
atagcatcaa gttctttaca ggagatgttt 
taggagagca tgagtttgta gaggagaaaa 
aagatgggcc cgacaccaag agagcaaaac 
agaaggtggt cttctgccct gttaaagaag 
caaaggctgc tctgaagcgc acgacctccg 



gcgcgcgggt ccggcgggcg gttggcttga 60 
gcgccatggt ggagaaggag gaggctggcg 120 
atgaccggca gatccgcctg tggggactgg 180 
tgcttcttgt cggcttgaaa ggacttgggg 240 
gagtgaaagg actgaccatg ctggatcacg 3 00 
agttcttgat tcgtactggg tctgttggcc 360 
ctcagaatct caaccccatg gtggatgtga 420 
cagagtcatt tttcactcaa ttcgatgctg 480 
tagttaaagt tgaccagatc tgtcacaaaa 540 
ttggctacca tggatacaca tttgccaatc 600 
ctaaagttgc caaagttagc caaggagtag 660 
ttgattcttc tgagacaacg atggtcaaaa 720 
ccctggaggt ggactggagc agtgagaaag 780 
actactttct ccttcaagtg ctcttaaagt 840 
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BNSDOCID <WO 0110903A2 I > 



WO 01/10903 

tccgtacaga taaaggaaga gatcccagtt 
tgctccagat acgaaatgat gtgcttgact 
aggactttgt caggtactgc ttctccgaga 
ttttggcaca ggaaattgtg aaggccctgt 
tcttcttcga tggcatgaag gggaatggga 
aagatttggc agccccagag atgccaactg 
tccttcatga aggcatctcc aggcaaggaa 
ttcctgcaac gaaggaggtg gtgccgacgt 
aaggggcgca gggtggctgt ctttgttcca 
ctgccagctc ccctgagtga tgagcacttc 
atgctgtccc ggcctcgcca gccctctggg 
caagctctgt tgctcgggag cctcttgtca 
ttatagagaa aagtgtgaat tcaggtggag 
gaagcacagc tccaagttca gacaggtgcc 
catttcctct ggagtttgag accctgacaa 
gagttttcgt ttaggattag ttgagttcca 
aagtcttgga tgttagggcg agaccctgca 
gcaggttcct ggggcttcag ggctaggagg 
tcacatgcgg ctcaagtcac tcagaggctg 
ttacctccta aaccacagct gtttgtgttt 
tttaaaggaa tgataataaa gttacttgct 
gaagcttctg agagggaatg ggatgatcct 
actcaagaga ttcctcctct ccctcaccat 
caagagttaa attgttcaca ttctagaatg 
ggataattca gctaaatagt ggctattttc 
tcaaaagtga tgcttcatat gccccaagtg 
ggaagaccag gtttgccttg atgcagccaa 
gtgtttcttc tttatgtaaa ttgtgtgatg 
aa 



PCT/L ! S00/21878 

ctgatacata tgaggaagat tctgagttgt 900 
cactgggtat tagtcctgac ctgcttcctg 960 
tggccccagt gtgtgcggtg gttggaggga 1020 
ctcagcggga ccctcctcac aacaacttct 1080 
ttgtggagtg ccttggcccc aagtgaactc 1140 
cagcatgccc acctgtattc cctgtcccct 1200 
aactgaagtc attggcccga tacaaaacat 1260 
gctgctcccc atcaccagca gctgctcgac 1320 
gcactgttca ggctgcctgt catcccgggc 1380 
caagcacccc tctgcccttt ctctgtcctt 1440 
gcattgtggg agatgcctgc caggaatgag 1500 
ccttcttgga cttattcccc acctgatacc 1560 
agtaggccca ggccccatga ggcaccagtg 1620 
cttagagagg aaaaccatga caggcaaatg 1680 
acaacaggtg gcatctggtg tgctgttctt 1740 
gctgggtttt gggagaaagg agatgctacc 1800 
agttgagtat tagagagctt gtctttcaag 1860 
gaggagcctg cccttttaac agaaccccag 1920 
ttgcatttca gggctatgtt ggtcctttgt 1980 
cacatatgtt gtgaattttc cttggttctt 2040 
ttaggatttg cttgtttttc ttccacttca 2100 
accagttgcc ttttcagacc tgaggctcta 2160 
tcctgccacc attttttctg ggtgatgcag 2220 
tgtagaagct Cctggctcca gttgtctaat 2280 
agtggcaaga attataataa taaagggaag 2340 
ccacagccgc tacgggtcag agcttttatt 2400 
agtgcgttcc agttcacccc actcttgctt 2460 
tttcaataaa tcagtctctt tcctaaaaaa 2520 

2522 



<210> 41 

<211> 2167 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 411344CB1 



<400> 41 

cgccggtctt ggggggatgg ttccatcatg 
ctgttggctt tgcaaaatga cccacctcct 
aattcaatta cacagtggat tgtagacatg 
gaaaaatttc aacttctatt taaatttagt 
atgtttactg gtgaaaatat tcctgttcat 
ttatccattc taacagaaga ctggtcccca 
attattagca tgctttccag ctgcaaggaa 
gcgcgaacat gtaacaagaa tccaaagaaa 
tgatgccact gttatcatcc tcctagcaga 
gatcattcag tctttgaact ttaacctttg 
acttcctttt actgcatttt tactcgtgtg 
gtccaggcaa ctgacatgct tttattagtc 
tcaaatataa ttccattttt tatttttatt 
tcatgtattg tgcaataaca atgacttcct 
tgggcgttgt aacaggaaaa gttttcatta 
agggcctatg aaatgaactt actggttata 
cattacttta aaagtcaccc caactgttta 
gttaacagca tgaattaaca tgcgtcttta 
tattgaattg tttatatcta ctgtcaagtt 
tggcagagag gtacagtatg ttatccctat 
ttcctcaatc tctgaagtaa aggtatgagt 
tttattttgg aagtaagaaa aatggcagtg 
gcttgttatt tgtacattaa agattttttc 
tagccatcag catgagccct actgcctaaa 
ccgtaaacat ttttgtttgc aatcttgttt 
aatactttta ttgaaacttt tgttaagttt 
tcatcttgtc ctttaatcct gtaccctaaa 



gcgtcaatgc agaaacgact acagaaagaa 60 
ggaatgacct taaatgagaa gagtgttcaa 120 
gaaggtgcac caggtacctt atatgaaggg 180 
agtcgatatc cttttgactc tcctcaggtc 240 
cctcatgttt atagcaatgg tcatatctgt 300 
gcgctctcag tccaatcagt ttgtcttagc 360 
aagagacgac caccggataa ttctttttat 420 
acaaaatggt ggtatcatga tgatacttgt 480 
agatagtcct actgagaaaa tgagcacttt 540 
actggaagtg acctataggc aatgaagact 600 
cattctgggc gcatgttgat cgctggttca 660 
atacagtatt aatgcaggtg tcaggaaatg 720 
tttttaagct tttggaaaag ctccaggtcc 780 
tggcggtttt ggtacgttca ttgccggcaa 840 
actcctgcca ttcaatgatt aatgcatgat 900 
gtgggaatat aaataaagtg agggatccaa 9 60 
tatttggatt ctatgcactg tgatcctaag 1020 
aaggactgta atgaaagatc attgcatatt 1080 
gttttgacat ggaagatttt caagtaacat 1140 
ggtgaaaata aattaatttg ttgtatatag 1200 
aatatagggt atgaatggtt taatcaaggc 1260 
atgattaaat tgctgcagtc cataatttgg 1320 
caagtaagtt acactctgtt aacttcctgc 1380 
cactatttca tttatttatg tttggaaacc 1440 
cttttgttat aagtcaagtt tgaatgttac 1500 
tttcttgtaa attttcttta cttgtgagta 1560 
ataagaaata catttttgac agaggcttaa 1620 
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tgctttaaca 
atggttgctt 
ttgttgaaaa 
aaaggtccag 
agtgattaat 
tgttattttt 
tgaatcaatt 
ccagtaaaac 
atagatgaca 
caaggta 



aaagagtgtg 
atttgtctca 
gactttgctt 
ttctcagtac 
gtattttata 
actaatgcct 
tgggcttaaa 
cattcaatcc 
ttttgtagct 



gacattttta 
cacagccata 
acagctagat 
catgtgagtt 
aattaccttt 
tattgtggca 
actgaaagcc 
ataattggta 
ttggacatgg 



ttttaaaatt 
tagtttttcc 
gaaactttct 
aacgatacta 
tcacatatgc 
ctctttttga 
agttgggctg 
aataatattt 
tggtaattaa 



<210> 42 

<211> 1826 

<212> DNA 

<213> Homo sapiens 



taggcaaaag 
tggagggttt 
atagaaaaaa 
caactaagtt 
aaaatctgtt 
aacatcctgc 
aaaggtttga 
taaaaatggt 
ggggcattaa 



tcacta 
tgtttt 
aaaatt 
cttttt 
tctact 
agtgaa 
aatacg 
tttaat 
ttttac 



tcaa 
gttg 
gttg 
aaaa 
acaa 
tata 
tacc 
ccgt 
actc 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2167 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1256390CB1 



<220> 

<221> unsure 
<222> 1755 
<223> a, t, c, 



or other 



<400> 42 

cccccgcctt 

tattaaatca 

tcatttgtta 

cacagccagt 

ttaaatgata 

aacattttaa 

gcatgtttat 

ctcattgcac 

gtagagtgct 

ggagggctgg 

gtgagtccat 

tatgagtgga 

ggagtgacct 

cccaagaaaa 

ttttgccaat 

aagctcctta 

tacacttgga 

tgttttggga 

agaaactgcc 

gctgttgcca 

tcttatgggc 

ccagaaatga 

aattatagag 

tgggcccgag 

tatgggtttg 

gtgctgtcag 

gtgacatctg 

cattctgccc 

tttctttgat 

ttttctctcc 

agtgagatag 



tctatagcat 
tgcatttttc 
gaagaacaac 
tagtatccag 
tatagaaaag 
aggtaaccac 
taatgaagca 
taaatgtgtc 
tgcttaagcc 
gatatgatag 
ggagcctgga 
tgagagagat 
atgagaccca 
tcatttggat 
ggttcgtcaa 
ggaacctgga 
caactgatcg 
cggatctcaa 
aagatcaaac 
gcttcataga 
agttaattct 
ttcaagttgg 
ttggatcgag 
acattgggat 
ttctgccaga 
tcctggatga 
ccactatgct 
aggctttttt 
actaattctg 
ttctntaact 
gatattacat 



agttctcttt 
aagatggaat 
ctacatgcta 
ttggtgctgg 
gtaaattttt 
ataaaaatgg 
gtacttctga 
ttccttggta 
tctgcttgaa 
atccttagcc 
gacgtattcc 
cagtgagaag 
ccccatgtat 
ggactgtgga 
agaaattcta 
cttctatgtc 
tctttggagg 
tcgaaatttc 
attctgtggg 
gagcaagaag 
cacaccttac 
acagaaggca 
tgcagatatt 
tcccttctca 
agctcagatc 
tgtgtatgcg 
gctgggcctg 
ctgccagaaa 
acattacctg 
gtttgtacac 
gccctg 



aggtggaatg 
caatctttga 
gagagagagg 
actccagcca 
aaagaaatat 
ttaatttttc 
ttagtatatg 
tagtggagga 
accctttatc 
caacacagac 
tataacatat 
tacaaggaag 
tatctgaaga 
attcacgcca 
caaaaccata 
cttccagttc 
aaatcccgtt 
aatgcatctt 
acagggccag 
gatgatattt 
ggctacacca 
gcaaatgcat 
ttatatgcct 
tatacgtttg 
cagcccacct 
aaacactggc 
ctggtgtcct 
ttccccttta 
tggaattttt 
actttaatgt 



attcctataa 
tttaatctaa 
aggaaatata 
ggtgtcctgc 
ttattaatat 
cattccaaag 
acattctgaa 
tttgaggatt 
ttttggggat 
aagagattgt 
accaccccat 
tggtgacaca 
tcagccaacc 
gagaatggat 
aagacaactc 
ttaacataga 
caccccataa 
ggtgtagtat 
tgtctgaacc 
tgtgcttcct 
aaaataaatc 
tgaaagcaaa 
catcagggtc 
agctgaggga 
gtgaggagac 
actcggacag 
gcatgtctct 
tttttcctcc 
gctgacttcc 
aataatttac 



gatttctcat 
gctgatattc 
cccacgacca 
ctcatggtag 
attcctataa 
taaatgctaa 
gttaattaaa 
ggaatataga 
gctggttcct 
ggacaagtca 
gggagagatc 
gcatttccta 
atctggtaat 
tgctcctgct 
aagtatacgc 
tggttatatc 
taatggcaca 
tggtgcctct 
agagactaaa 
gaccatgcac 
aagtaaccac 
gtatggaacc 
ttcaagagat 
cagtggaaca 
catggaggct 
tgctggaagg 
tctctaagtg 
tagtaaagtc 
acagattttt 
agacttgtgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1826 



<210> 43 
<211> 1371 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



1786774CB1 
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BC4SDOCID (WO 01109O3A2 I > 
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<400> 43 

gcaaaaattg 

Ctcatcgtca 

agagcctcaa 

taaggctcca 

caccaccccc 

ggaagccttg 

ggccaactc t 

cagtgaaaag 

ctatcaaaag 

tatttgagtg 

ctaggatgta 

tagtaacttg 

gctgtgtaac 

cctttaattc 

acctccaccc 

ataggaagga 

gataggagtc 

atagggagaa 

ttcctccaac 

acccttccat 

cttgacagcc 

aggattcaca 

aatcgcttga 



tttcaaaaaa 
cctcaacctg 
aaagaaattt 
gagccaacgt 
aggccactgc 
aagcctggaa 
tgtgcccgat 
gaatgtcaag 
catagaactc 
tatactgagt 
catattatta 
tcaactgtca 
ttcaatacac 
ctgttgtatg 
actcacctcc 
cttcggtgtg 
gggatggtga 
accaatctga 
atgtttggta 
taaaatatgt 
aggaaccact 
ctgaaaagtg 
ccctgggaag 



tttgtgagga 
gatttgggat 
ctgagtcatt 
gggctgatga 
tcagcacccc 
actgtggtga 
tctggttcag 
aaacctgcat 
cctaatttcc 
atttacaact 
accacttata 
cccaactaaa 
agactctatc 
tactatgtca 
ctaacctcct 
agaattagaa 
gggtagttag 
attacaatta 
acaagtgctt 
cagtgggccc 
tgaagctggc 
agccaccaaa 
cagaggttgc 



ttttgattcc 
gtcaggggaa 
gagtgtcacc 
tctgcctgcc 
tgtggatggg 
atatgtggtt 
tggctgtaat 
tcaaggatga 
acatattcac 
tatacatgta 
taggtaagaa 
aagtttcaga 
tacttcacaa 
gctcaaaccc 
tatgtccctc 
atgatgtaaa 
atttttgcct 
cttaaatgta 
aatgtatgtt 
ttccatttta 
atccactctt 
ttgaaatgcg 
agagagccgg 



tatctcgttc 
gaactcagtg 
agagaccagg 
actacctcat 
gcagaggatc 
cgatggtatt 
ggctcaggaa 
gcaagtaaat 
ccaatacaaa 
attgaattct 
agctgaggct 
gctgaggatt 
cctgcaatgt 
ctacccctgt 
acagtagcaa 
tgtttacgca 
cacttgccct 
tcacagactg 
aaaataaaga 
tggagtggaa 
gaacagagag 
agtaatggag 
gattgcccca 



aaatttctgg 
aatctactcc 
atgaagatga 
ctgaggccac 
ctagatgttt 
atgacaaaca 
atagattcaa 
tggcctgtct 
tacagcacta 
cactacagcc 
ctgagaagtt 
tagacttaga 
gattctgatt 
ccctgcccat 
gatgtaggtg 
ggagtgctgg 
gaaagtggta 
tcactttgta 
aggtttttat 
tgggaaggcc 
tattaaagac 
agtaacggag 

g 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1371 



<210> 44 
<211> 3634 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1911808CB1 



<400> 44 

gcggcggcag tggcgcgcac aggtgattga 
ctccttgctg gcaggtggcg aagcccattg 
cggcggtctg gctcgggagg cgttcctggg 
gagaaggcgg cctggcgctg ggcggagacg 
atcgagaccg cttaccgcat ctggctggag 
tgcaaaggaa atccgaattg cttggttggt 
gatgaaaata gttttcataa catcgatgat 
tttgtgggcc tgactaacct tggagccact 
tttctcaact tggagcttcg gcaggcactc 
atgctgggag acggcatcca agaagaaaaa 
ctccagtact tgtttgcctt gttgcaaaac 
tttgttaaag ccttgggcct ggacactgga 
ctctttatgt ctctattgga agatactttg 
attgttcaac agcagttctg tggagaatat 
agagagtcta agcttttgtc aaaattttat 
cagttaacag attgtatctc ggaatttttg 
tatttttgcg agaactgtca aagcaaacag 
cttccttgca ctctgaactt gcagctaatg 
aagaaaaagc tgaataccta cattggcttc 
gaacataaag gtgggtccta cgtgtatgaa 
agtgcttatt ctggccacta catcgcccac 
aagtttaatg atgaagacat agaaaagatg 
gaagatctag cagaaccttc taagtctcag 
cattgctctc gaaatgcata tatgttggtt 
actactgttc aagttccagc ctttcttcaa 
gaggagtggt gtattgaaat ggctgagatg 
aaacacgaag aggttaagga gctgtaccaa 
tttgtctctc tggaatggct gcaaaagtgg 
gataatcacg cttgcctgtg ttcccatgac 
aagaggatat ctgaatatgc agctgacatt 
ctaactgtga aagccctgtg taaggaatgt 
aagaaccaac taaatgaaga ttataaaact 



ctggccagct gcctgaagga gcgccaggtc 60 
gggcggcggt gcagaccgcg gcggcggctg 120 
gccaaggcca tggccccgcg gctgcagctg 180 
gtgcggcccg aggaggtgtc gcaggagcat 240 
ccctgcattc gcggcgtgtg cagacgaaac 3 00 
attggtgagc atatttggtt aggagaaata 3 60 
cccaactgtg agaggagaaa aaagaactca 420 
tgttatgtca acacatttct tcaagtgtgg 480 
tacttatgtc caagcacttg tagtgactac 540 
gattatgagc ctcaaacaat ttgtgagcat 600 
agtaataggc gatacattga tccatcagga 660 
caacagcagg atgctcaaga attttcaaag 720 
tctaaccaaa agaatccaga tgtgcgcaat 780 
gcctatgtaa ctgtttgcaa ccagtgtggc 840 
gagctggagt taaatatcca aggccacaaa 900 
aaggaagaaa aattagaagg agacaatcgc 960 
aatgcaacaa gaaagattcg acttcttagc 1020 
cgttttgtct ttgacaggca aactggacat 1080 
tcagaaattt tggatatgga gccttatgtg 1140 
ctcagcgcag tcctcataca cagaggagtg 1200 
gtgaaagatc cacagtctgg tgaatggtat 1260 
gaggggaaga aattacaact agggattgag 1320 
acacgtaaac ccaagtgtgg caaaggaact 1380 
tatagactgc aaactcaaga aaagcccaac 1440 
gagctggtag atcgggataa ttccaaattt 1500 
cgtaagcaaa gtgtggataa aggaaaagca 1560 
aggttacctg ctggagctga gccctatgag 1620 
ttggatgaat caacacctac caaacctatt 1680 
aagcttcacc cggataaaat atcaattatg 1740 
ttctatagta gatatggagg aggtccaaga 1800 
gtagtagaac gttgtcgcat attgcgtctg 1860 
gttaataatc tgctgaaagc agcagtaaag 1920 
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ggcagcgatg 
gaacagctgg 
accttaaata 
gaagatattc 
tctaaagagg 
tacaaagagt 
ttacataaga 
aacagaccgt 
ttctttgtag 
tcagttggga 
atgaccaaag 
caaaagctct 
aacccttcag 
ttattgtgtc 
aaagttgtgg 
tctgaaacag 
aatcaaatca 
aagattttaa 
attttattga 
gtttgtatgc 
acttgctgac 
aagcaaaggt 
ttcaaacgtg 
cagacttgaa 
aagtaaagac 
atagtcgtgg 
gcgttggtcc 
gtaattttag 
ataaagcatg 



PCT/US00/21878 



gattttgggt 
atgagcaaga 
aagatgaatc 
tgtgtccaca 
cttggagcaa 
gctgttcaca 
tgattgcaaa 
gtctcagcaa 
aagagtggcg 
acagtgctct 
aagattctaa 
ttgttgtgga 
aaacacagta 
agcagcagag 
ataataaaaa 
aggaggacaa 
tgcatgcatt 
gtgatgactg 
aggctgatga 
cagaagaagg 
tgctaagaaa 
ggatttaaga 
actgtcccaa 
ggaaatgttt 
ggaacaagtt 
gaggggaata 
ataaaggatt 
agacataacc 
aaagtgaaga 



ggggaagtcc 
tggtgatgca 
aaaggaagaa 
tggtgagtta 
actgcagcag 
gtgcaagatt 
cgagcaaaag 
ctggccagag 
gaaatttgtt 
tttgtgtccc 
acttatagct 
tcatgtaatt 
tatttctgag 
ggacctgcgt 
ggtgatgaag 
ggaagaagct 
ttcagttgct 
tgccacccta 
accaattgca 
gtttaaaggt 
tgaccagagg 
attaaaccat 
atgccttatg 
ccaattttat 
tagtagcagt 
aagttctgtt 
gtatcaagta 
tgcttaccaa 
acttgtaaaa 



tccttgcgga 
gaacaaagca 
agaaaagaag 
tgcatatctg 
tactttccaa 
ttagaaagag 
acttctctcc 
gatacggatg 
agaaagccta 
cacgggggcc 
ctcatatggc 
aaaatcacga 
cccaaactct 
gaatacactc 
gattcggctc 
aaaccagatg 
ccttttgacc 
ggcacccttg 
gattatgctg 
actggtcttc 
ggaagaggag 
tacatgcccc 
tcaaataaag 
atttaagggg 
aatagtaaat 
atatttcctt 
gatgggacaa 
tgcctgtctt 
aaaa 



gttggcgcca 
acggaaagat 
aggaggaatt 
aaaatgaaag 
aggctcctga 
aaggggaaga 
caaatctgtt 
tcctctacaC 
caagatgcag 
tcatgtttac 
ccagtgagtg 
gaattgaagt 
gtccagaatg 
aagccaccat 
cggaactgaa 
gagaaaaaga 
agaatttgtc 
gcgtcattcc 
caatggatga 
ttggacatta 
tttgacatgt 
ttccaaaagg 
cagattgcac 
ggtggtgggt 
catgtttaca 
gctcgagttt 
cattctgctc 
tgattcatat 



gctagctctt 
gaacggtagc 
aaattttaat 
aaggcttgtt 
gtttccaagt 
aaatgaagcc 
ccaggataaa 
cgtgtc tcag 
ccctgtgtca 
atttgcttcc 
gcaaatgata 
gggagatgta 
cagagaaggc 
ctatgtccat 
tgtgagtagt 
tccagatttt 
aattgatgga 
tgaatctgtc 
tgtcatgcaa 
atctttgaat 
tagggcatta 
cagaaatcca 
tgatggacat 
gggagggggc 
tatgagattt 
cataccagat 
tgaacgaaaa 
tctactttca 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3634 



<210> 45 
<211> 1661 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1973875CB1 



<400> 45 

ctcgagccga ttcggctcga ggcggactgg 
tggcccggcc acctgggacc gtgctgggga 
caaagttttg gcggacccat cgctggggct 
gcccgatgcc gggcggccgg agccattgac 
ccaccccgcc gcctccggtg catggggact 
gggagacagc gccgggagag gccggcagcc 
aatgagcccc tgaagaaaga gcggcttaag 
cagctgcgga gcaaacggga tgagttctgg 
gagatctggg atgccctcaa ggctgccgcc 
gcccaggcca tcctggatgg agccagcatc 
tacgatgagc tgggcaatcg ctaccagctg 
ctgctgctgg agcacacgga ggaggagagc 
cgccgtgagt tcccgctgaa ggtgcgcctg 
agcctgcccg acacagtggg gcagctcaag 
ccatcgtggc agcggtggtt cttctccggg 
gagaccaaga tccagaaaga ttttgtcatc 
caggactgat gggcccacgg acccctggga 
ccctgctgct gccttccagt gctgtcattt 
gtgggtgagc cgtgaaggga ccctgccttt 
cccagggagc aggcagccac acacgggcct 
gccctcaccc tgcctgtctc ccgaagcagg 
ctggagccca gatctgtcat ctggtgctgc 
ggtctgaagc agctgctgtc tccctcctct 
agtgccactc ccttggaagg gagggaacca 
ggcgggcctc tgcctctccc tgcccccagc 
cggctgggct gtcgtggcag ggtctgcaca 
gggaggtctg cagcctggtc atggcctcca 



tctccggccg ggcaccgtcg cgggcccccc 60 
gtctgccact tccctctctc ccctggcccg 120 
gagcgcgccc ccggggggag atcggggagc 180 
ccgggacgcc gccgtccgct gagcagccga 240 
ggctgaggag ccagcatggg caactgcgtg 3 00 
ccgggacacc cccgcaagcg agcaggacgc 3 60 
tggaagagcg actaccccat gactgacggg 42 0 
gacacagcgc ctgccttcga gggccgcaag 480 
tatgctgctg aagccaacga ccacgagctg 540 
accctgcctc atggcaccct ctgtgaatgc 600 
cccatctact gcctgtcacc gccggtgaac 660 
ctggagcccc ccgagcctcc acccagcgtg 720 
tccacgggca aggacgtgag gctcagcgcc 780 
aggcagctgc acgcccagga gggcatcgag 840 
aagctgctca cagaccgcac acggctccag 900 
caggtcatca tcaaccagcc cccaccaccc 960 
agaggccccg cctggagcac taggccccca 1020 
tcttcagggg ccctcccctc ggtgtggctg 1080 
cagggcacta cgcgccacca gttcccggta 1140 
tgcaaccttg tcagagaaaa ggcgaacagg 1200 
ttcgagccac aagggccaac caggaggccc 1260 
cagctgtgct cactctggtt ttctgctcag 1320 
gcccccatcc cctggctctc ccctgggcac 1380 
cccgtgagcc ccagggcttg ggaagcctga 1440 
acaattggca gagatgaggc gggtggtgga 1500 
gggccatgtc ctggctgtaa cccaggcagt 1560 
cagcaggtcc ctgtggacag acatatctgc 1620 
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atatttatca ataaagcctt ttgctccttc aaaaaaaaaa a 

<210> 46 
<211> 1910 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 2323917CB1 

<400> 46 

gtactccgcg cgcagccctg gcggtccagc gcgaatctcc agaatctggt ctccagaaac 60 
attattctaa tctgtgtatg gaaaaatccc agaaaattaa tccttttata ttgcatatac 120 
tccaagaagt ggatgaagaa attaaaaagg ggctagcagc aggaatcaca ttaaacattg 180 
ctggtaacaa tcgcttagtg ccagtagaaa gagttacagg tgaagatttt tggattcttt 240 
ccaaaatttt aaagaattgt ctgtatatta atggtttgga tgttggatat aaccttctat 300 
gtgatgttgg tgcatactat gctgcgaaac tgcttcagaa acaacttaat ctcatttact 360 
taaacctcat gtttaatgat attgggcccg aaggtggaga attgattgct aaagtgctac 420 
ataagaatcg gactctgaaa tacctaagaa tgactggaaa caaaattgaa aataaaggtg 480 
gaatgttttt tgctgcaatg ctgcaaatta attcatcatt agagaaatta gatctgggtg 540 
actgtgatct gggaatgcaa agtgtgatag catttgctac agtactaact caaaaccaag 600 
caattaaggc aataaaccta aaccgaccta tactgtacgg tgaacaggaa gagtctacag 660 
tccatgtagg cctcatgttg aaagaaaatc actgtcttgt tgcactacac atgtgtaagc 720 
atgatataaa aaacagtggt atacaacagt tatgtgatgc actgtatctg aacagtagcc 780 
tgcgctacct tgatgtcagc tgcaacaaaa taactcatga tggaatggtg tatttggctg 840 
atgtactgaa aagcaacact accctggaag taatagatct ttcctttaac agaatagaaa 900 
atgcaggcgc aaactatctc agtgaaactc ttacttcaca caacaggagt cttaaagcgt 960 
tgtcagtagt cagcaacaac atagagggag aaggacttgt tgcactttca caatcaatga 1020 
aaacaaatct cactttctct catatctaca tttggggaaa caaatttgat gaggctacgt 1080 
gtatagcata ttcagactta attcaaatgg gttgtctaaa accagacaat acagatgtgg 1140 
agccatttgt ggtagatgga cgtgtatatc ttgcagaagt ctccaatggc cttaaaaagc 1200 
attattattg gacatcaact tatggagaat cttatgacca ctcatctaat gcaggttttg 1260 
ctcttgttcc agtaggtcaa cagccatgaa aaagtagctg taaaataatt ttcatacact 1320 
tgtcttattg tttcacagag attaatattc tatagcctat tttcttttat aaattaaaac 1380 
aagttacttt tttcaaatgt gtaaaagata atgactgtat taaactttgt ataactgtca 1440 
aaaaaaaaaa aaagggggcg gccccgatta gtgagctcgt ccccgcgggt atttactccc 1500 
ggggcccggt cccttggggg gggttaacca gtctttccgc caagggtggt gtcctgtata 1560 
tacaggcgtc ttggaaagag aagtcgtgct gtccctgggc tcttcttctg gtcggagggc 1620 
attttactcg tctgtccccg ctggtgtaat atagtaccgc cacgcgtgtg gtccgtggta 1680 
tgttggaaat aagtgcttgg tggagtctcg ctcggtgttg gacgtaggta gatcctcgca 1740 
gacgttctca tatcggcttg gcgttggtcg tggccccgct tcccgtggtt gtgttgtcga 1800 
ggtagtggtg ttgcacctgt atttatattg gtccctccgc cgcgcggggt ggaggcgtgg 1860 
ttggtttagt agctcgtggc ggttgttgtg ttcctcaacc cacacacaat 1910 



PCT/US00/21878 

1661 



<210> 47 
<211> 2162 
.<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2754960CB1 



<400> 47 

gacagcgcaa ccgcactcca gcctggacga 
aaaaaaaaaa aagcgagggc tggaagtctg 
gggaagctcg ggccggcagg gtttccccgc 
cgctgtaagt ttcgctttcc attcagtgga 
cggggccaga ccaaggcggg cccggagcgg 
gtcccgacgt ggaactcagc agcggaggct 
tcgtgcctgg ctcacataag cgcttcctgg 
caggcagctg cggcctgggg gttttggagt 
tgaggcaaat ctgtcagtcc atcctggctg 
aaaagaagga agaagacagc aacatgaaga 
gggactaccc tcatggcctg gttggtttac 
ccttgattca ggtgttcgta atgaatgtgg 



cagaatgaga ctccgtctca aaaaaaaaaa 60 
gagcaggtgc gcggctgcaa cggcagccgc 120 
acgctggcgc ccagctcccg gcgcggaggc 180 
aaacgaaagc tgggcggggt gccacgagcg 24 0 
aacttcggtc ccagctcggt ccccggctca 300 
ggacgcttgc atggcgcttg agagattcca 3 60 
aagtgaagtc gtgctgtcct gaacgcgggc 42 0 
gatcacgaat gagcaaggcg tttgggctcc 480 
agtcctcgca gtccccggca gatcttgaag 540 
gagagcagcc cagagagcgt cccagggcct 600 
acaacattgg acagacctgc tgccttaact 660 
acttcaccag gatattgaag aggatcacgg 720 
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tgcccagggg 
agaagatgca 
agaagtgcaa 
ggaacctgat 
atacgatccg 
acagcagcat 
cactggagga 
gcttctgtga 
tgccccagac 
agatctgcca 
agcgagagtc 
ttgcgcacgt 
atggaaaatg 
agCgtaccta 
tgaagatgga 
ccatttccgt 
gttttcaaac 
catgaggccc 
tgtggctgct 
cctgtgggaa 
gccaaaggtc 
tactggctga 
tatgaatcaa 
cttctccata 
aa 



agctgacgag 
ggacagccgg 
cgtgcccttg 
taaggaccag 
ggtgaaggac 
gctcaccctc 
cgccctgcac 
gaactgcggg 
cctgacaatc 
ctccctgtac 
ttgtgatgct 
gggaatggca 
gttctgcttc 
cggaaatcct 
gtgctaatgg 
tcctggatct 
tatataactg 
ctcaggtcct 
cggtcctggg 
cttcaggggt 
agtggcaggg 
atatcagtgc 
gtgttttgta 
agatagtgtg 



cagaggagaa 
cagaaagcag 
tttgtccaac 
atcactgatg 
tccttgattt 
ccactttctc 
tgcttcttcc 
aagaagaccc 
cacctcatgc 
ttcccccaga 
gaggagcagt 
gactccggtc 
aatgactcca 
aactaccaet 
aaatgcccaa 
acggagtctt 
agccttattt 
gatcagtcag 
tgctcgctgc 
tcccagtggg 
ggtatttcag 
tgtttgtaat 
actgctattc 
ataaacacag 



gcgtcccttt 
tgcggcccct 
atgatgctgc 
tgcacttggt 
gcgttgactg 
tttttgatgt 
agcccaggga 
gtgggaaaca 
gattctccat 
gcttggattt 
ctggagggca 
attactgtgt 
atatttgctt 
ggcaggaaac 
aaccttcaga 
ctaagagatt 
ataattaggg 
aatggatgct 
tgtgcaagac 
gagagcagtg 
tattatacaa 
ttttcacttt 
atttattcag 
tcatgaataa 



ccagatgctt 
ggagctggcc 
ccaactgtac 
ggagagactg 
tgccatggag 
ggactcaaag 
gttatcaagc 
ggtcttgaag 
caggaattca 
cagccagatc 
gtatgagctt 
ctacatccgg 
ggtgtcctgg 
tgcatatctt 
gattgacacg 
ttgcaatgag 
atattatcaa 
ttcaccagca 
attagccctt 
gcagtgggag 
ctgctgtgac 
gagaaccaac 
caaatattta 
agttattttc 



ctgctgctgg 
tactgcctgc 
ctcaaactct 
caggccctgt 
agtagcagaa 
cccctgaaga 
aaaagcaagt 
ctgacccatt 
cagacgagaa 
cttccaatga 
tttgctgtga 
aatgctgtgg 
gaagacatcc 
ctggtttaca 
ctgtcatttt 
gagaagcatt 
aatatgtaac 
gacccggcca 
tagttatgag 
gcatctgggg 
cagacttgta 
attaattcca 
ttgatcatct 
cacaaaaaaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2162 



<210> 48 
<211> 578 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3092341CB1 



<400> 48 

gaaaggccca cctgtgtcct ggttgagggt ctccagggtt ctttggggcc cgaggaattt 60 

cagacctgaa caatcaggtg gacaacttct tgtgactttg ggtattctga aattactcat 120 

cttgactcta ctgtgcctgg atgtagctgt gtgttttgtg tgtatcctgt tcatggccaa 180 

tggtggcaga gtctacatag aactatgctt cgtggtgttc tggggaaaac ctttcgactt 240 

gttggctata ctattcaata tggctgtata gctcattgtg cttttgaata cgttggtggt 300 

gttgtcatgg tgccaatggg tcatgtttgg ttagaaggtg acaatctaca gaattctaca 360 

gattccaggt gctatggacc tattccatat ggactaataa gaggacgaat cttctttaag 420 

atttggcttc tgagtgattt tggattttta cgtgccagcc ctaatggcca cagattttct 480 

gatgattagt aagcatttat tcttttgact tgattattgt ctccttttca tgtgaattta 540 

ttactcccgt tgaaaccgtg tacttaccaa taaactat 578 



<210> 49 
<211> 1300 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3658034CB1 



<400> 49 

agggaatccc aattatgagg cggctaagca 
ctactcagga taccactgtg gctttaaagg 
gaaaggacaa atatccaagt gaccgtgacg 
ctgattgaca cacacaaccg cttactcctt 
acggcagtta atatttccgc aaatggtttt 
tataatgtga aggcttctgg gtcttctaga 
tttgatttag atgttgctgt aaaagaaaat 
gtgtgtacaa gcttttcggg cccgggtagg 
ttaagtggct ttatggtgcc ttcagaagca 
gaatatgatc atggaaaact caacctctat 



ggcaaagaaa tagcttgggt ggttttgcat 60 
ctgtctgaat ttgcagccct aatgaataca 120 
gggcctagct caccaagtcc tgtaaagttt 180 
cagacagcag agcttgctgt ggtacagcca 240 
ggatttgcta tttgtcagct caatgttgta 300 
agacgaagat ctatccaaaa tcaagaagcc 3 60 
aaagatgatc tcaatcatgt ggatttgaat 420 
agtggcatgg ctcttatgga agttaaccta 480 
atttctctga gcgagacagt gaagaaagtg 540 
ttagattctg taaatgaaac ccagttttgt 600 
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gttaatattc ctgctgtgag aaactttaaa gtttcaaata cccaagatgc ttcagtgtcc 660 

atagtggatt actatgagcc aaggagacag gcggtgagaa gttacaactc tgaagtgaag 720 

ctgtcctcct gtgacctttg cagtgatgtc cagggctgcc gtccttgtga ggatggagct 780 

tcaggctccc atcatcactc ttcagtcatt tttatcttct gtttcaagct tctgtacttt 840 

atggaacttt ggctgtgatt tatttttaaa ggactctgtg taacactaac atttccagta 900 

gtcacatgtg attgttttgt tttcgtagaa gaatactgct tctattttga aaaaagagtt 960 

tttcttcttt ctatggggtt gcagggatgg tgtacaacag gtcctagcat gtatagctgc 1020 

atagatttct tcacctgatc tttgtgtgga agatcagaat gaatgcagtt gtgtgtctat 1080 

attttcccct ctcaaaatct tttagaattt ttttggaggt gtttgttttc tccagaataa 1140 

aggtattact ttagaatagg tattctcctc attttgtgaa agaaatgaac ctagattctt 1200 

aagcattatt acacatccat gtttgcttaa agatggattt ccctgggaat gggagaaaac 1260 

agccagcagg aggagcttca tctgttccct tcccacctcc 1300 



<210> 50 
<211> 2241 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3883861CB1 



<400> 50 

gctccccgag ccgggctgca ccggaggcgg cgagatggtc gcgcgcgtcg gcctcctgct 60 
gcgcgccctg cagctgctac tgtggggcca cctggacgcc cagcccgcgg agcgcggagg 120 
ccaggagctg cgcaaggagg cggaggcatt cctagagaag tacggatacc tcaatgaaca 180 
ggtccccaaa gctcccacct ccactcgatt cagcgatgcc atcagagcgt ttcagtgggt 240 
gtcccagcta cctgtcagcg gcgtgttgga ccgcgccacc ctgcgccaga tgactcgtcc 3 00 
ccgctgcggg gttacagata ccaacagtta tgcggcctgg gctgagagga tcagtgactt 3 60 
gtttgctaga caccggacca aaatgaggcg taagaaacgc tttgcaaagc aaggtaacaa 420 
atggtacaag cagcacctct cctaccgcct ggtgaactgg cctgagcatc tgccggagcc 480 
ggcagttcgg ggcgccgtgc gcgccgcctt ccagttgtgg agcaacgtct cagcgctgga 540 
gttctgggag gccccagcca caggccccgc tgacatccgg ctcaccttct tccaagggga 600 
ccacaacgat gggctgggca atgcctttga tggcccaggg ggcgccctgg cgcacgcctt 660 
cctgccccgc cgcggcgaag cgcacttcga ccaagatgag cgctggtccc tgagccgccg 720 
ccgcgggcgc aacctgttcg tggtgctggc gcacgagatc ggtcacacgc ttggcctcac 780 
ccactcgccc gcgccgcgcg cgctcatggc gccctactac aagaggctgg gccgcgacgc 840 
gctgctcagc tgggacgacg tgctggccgt gcagagcctg tatgggaagc ccctaggggg 9 00 
ctcagtggcc gtccagctcc caggaaagct gttcactgac tttgagacct gggactccta 9 60 
cagcccccaa ggaaggcgcc ctgaaacgca gggccctaaa tactgccact cttccttcga 1020 
tgccatcact gtagacaggc aacagcaact gtacattttt aaagggagcc atttctggga 1080 
ggtggcagct gatggcaacg tctcagagcc ccgtccactg caggaaagat gggtcgggct 1140 
gccccccaac attgaggctg cggcagtgtc attgaatgat ggagatttct acttcttcaa 1200 
agggggtcga tgctggaggt tccggggccc caagccagtg tggggtctcc cacagctgtg 12 60 
ccgggcaggg ggcctgcccc gccatcctga cgccgccctc ttcttccctc ctctgcgccg 1320 
cctcatcctc ttcaagggtg cccgctacta cgtgctggcc cgagggggac tgcaagtgga 1380 
gccctactac ccccgaagtc tgcaggactg gggaggcatc cctgaggagg tcagcggcgc 1440 
cctgccgagg cccgatggct ccatcatctt cttccgagat gaccgctact ggcgcctcga 1500 
ccaggccaaa ctgcaggcaa ccacctcggg ccgctgggcc accgagctgc cctggatggg 1560 
ctgctggcat gccaactcgg ggagcgccct gttctgaagg cacctcctca cctcagaaac 1620 
tggtggtgct ctcagggcaa aatcatgttc cccacccccg gggcagaacc cctcttagaa 1680 
gcctctgagt ccctctgcag aagaccgggc agcaaagcct ccatctggaa gtctgtctgc 1740 
ctttgttcct tgaagaatgc agcattgtct ttgtctgtcc ccaccacatg gaggtggggg 1800 
tgggatcaat cttaggaaaa gcaaaaaagg gtcccagatc ccttggccct ttcctccgag 1860 
gacttctatc ctccccaggc ctttgtttct tcggctaaag gtacagttcc tttcaagagg 1920 
taacagcact gggatccaag cagggggatg aaaaactcag cagagaaatt cgagaccatt 1980 
ttgcaagact gtgcccttct cctcaggacc ccctggctca gttcttgaaa aacggtgtca 2040 
tatttagtca gaggccccac ccccaggaag catggatggg gatgaaggca caggcgtctc 2100 
caacctcaga ggccctttgt ggggtcagga cacagagtgg gagggagact gatgcaggcc 2160 
taccagtccc tggctttttg tctggggctg gaataaagag gtgccttcag ctggtgggcc 2220 
gagaggcagg aaaaaaaaaa a 2241 

<210> 51 
<211> 1860 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<223> Incyte ID No: 



4993873CB1 



<400> 51 

agaattgacc 

ggtagaggtg 

ggtccagctt 

cttcttgccc 

gagccagccc 

tataaagagc 

accaccctgg 

ggcctgggat 

ctcctccaca 

ttcctagaca 

ggagcttttg 

gactatttga 

gacacgttca 

agtcgctacc 

gtccccatga 

actgtcctcc 

gggaaaatga 

ttgctcctgc 

aacctggaag 

ttctcaggag 

gtggacatga 

ccatctctga 

ctttgggagg 

gggtaaccat 

cagaaccagc 

taataaagtt 

gcggcgtaaa 

ggggggccgg 

gggggggcgg 
ccataggacc 
gaccgacttt 



tgacaaatat 
atgcagtgct 
ggctttggc t 
atggagataa 
ccgcctacca 
tggcagcaga 
ccctgctctc 
tcaacctcac 
cccttgccct 
agcgactaaa 
ctttttctgc 
gaaggcaaac 
tggttcttgc 
agacccagaa 
tgcaccaaaa 
agatagaata 
agcaggtgga 
ccagtctgtt 
acatacttcc 
tcactgggca 
gtgagaaggg 
acaccatgtc 
tcaccaccca 
ggtgggaggc 
ctgcatcctg 
gaccttgggt 
ggatgctgaa 
cagtacagaa 
agagagtttt 
cccggagctg 
tagtggcaca 



tgaccaggtt 
gaagacctgg 
actgggaaca 
aagtctgcag 
cagaatcaca 
cgcccccgga 
tcttggggcc 
agaaacccct 
gcccagcccc 
gcctcggcag 
caacttcaca 
atacgggcaa 
caattacatc 
gcaggcaagt 
ggaaatgcac 
cagaggaaat 
ggctgctctg 
ggatttgcac 
ccaaattggt 
gctcaacaaa 
gaccgaggcc 
agacccacat 
gagcttactc 
caggagttat 
gggctgctat 
tctgtgaaaa 
agcaaatacc 
ggtccctcag 
tacgcatctt 
gagggttgtt 
aactgtgggg 



gctgtgatct 
gcccctgctc 
gggatcctgg 
gggcctcaac 
cccaccatta 
aacatcttct 
caagctaaca 
gaagccgaca 
aaactcgaac 
cactatttgg 
gattctgtta 
gtcgtggact 
ttcttcaaag 
ttctttgtgg 
agattcctct 
gccttggcgc 
cagccacaga 
ttgccaaggt 
ctcaccaaca 
accatctcca 
ggggctgctt 
gcccacttca 
ttcctgggaa 
cttatctcat 
gtggttcagt 
aaaaaaaaag 
ccggtttgcc 
agtgtgtttg 
tgcacgtgtt 
ttgcgccggc 
cttctcatat 



tcaaacacca 
agtgcctttg 
cctctgtcca 
cccccaggca 
ccaattttgc 
tctcgccagt 
cctcagctct 
tccaccaggg 
taaaagtagg 
acagcatcaa 
caactgggag 
gcctcccgga 
ccaagtggaa 
atgagaggac 
atgaccagga 
tgctggtcct 
ccctgagaaa 
tttcaatttc 
tactcaactt 
aggtgtcaca 
caggcctcct 
acaggccttt 
aagttgtcaa 
cctggaccaa 
taatcagtgt 
aacaaccaac 
cgtccaaatt 
atggccgtcg 
agcacccggg 
actcttgggg 
gtgtgcgcgc 



ttttcctgtg 
ctctagaatg 
ctgtcagccc 
tcagctctca 
tttgcgtttg 
gagcatctcc 
gatcctggag 
cttccggagc 
aaactccctg 
ggagctttat 
gcagattaat 
gttcagccag 
gcaccctttc 
ttctctccag 
tttggcttgc 
ccctgacccg 
atggggccaa 
tggaacatat 
agaagctgac 
caaggcgatg 
ctcccagccc 
cctcttgctc 
cccagttgca 
acagataggc 
gccaagattc 
gcataaagtt 
taacataagg 
actgattggc 
gggggggggc 
gggtgtttat 
gcttctgttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 



<210> 52 
<211> 550 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5208004CB1 



<400> 52 

ggcgggtggc 

ggagcaggac 

attctgaatc 

ccacagggac 

gggaagcctg 

tcccttctca 

ctgtcccaaa 

tcaggcacca 

ccccgctggg 

ctgaatgtgt 



tggaggacac 
tgtttctgga 
cgagaaagtg 
cgaccagaga 
agaggcggcc 
accctttctc 
actcctggct 
tcccccccac 
ttcctgatgc 



ggtgtaggcc 
aacactgggc 
aaggaaggat 
tgcggtggag 
agcaccaccc 
tcctcctcgc 
cctttgttct 
gagggatcca 
cttgtgcaca 



ttgccgctgt 
tgctggtgac 
ggtggggaag 
acagaggagc 
tccgctcact 
ctcttccctt 
gctgccgtgg 
cacaacaggt 
gggaagtgct 



ctgggttcct 
ccgtcaaact 
tgaggaggca 
ttccttctca 
ctcccctcag 
tcttgctgtc 
cccccaccca 
tcatgctggg 
gcagtcattt 



cgccactgca 
tctcccacaa 
ggagcagagg 
ggctgtttct 
cctcttcgct 
ttgttagtcc 
ggaggaggtc 
gctgggggag 
ttggactctc 



<210> 53 
<211> 2597 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5267783CB1 



60 

120 

180 

240 

300 

360 

420 

480 

540 

550 
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<400> 53 

gcctggttgg gccgcgtcct tctgctgccc 
aggtgtcctg gagtctggcc caggacctgg 
tcccgggaca aggaccgaag tgcgacggtc 
aaaggaagcc atccttcatc tacaccccag 
tcaccatacc tcctacaaca tgcctacaat 
gccttcgaca aggccaggaa ggaaaacaag 
tgccactggt gccacatgat ggaagaggag 
ctcagtgagg actttgtgag tgtgaaggta 
gtgtacatga cgttcgtgca ggccaccagc 
ctgactccca acctccagcc ctttgtcggg 
acccgagtcg gcttccgcac agtgttgctg 
aacaccctgc tagaaaatag ccagcgtgtc 
agcgtgggtg accgccagct gccgccctct 
cagctggatg agggctatga tgaggaatac 
acgccggtga tcctgagctt cctgttctcc 
ggctctcggg cccagcagat ggccttgcat 
cgggaccatg tggggcaggg ctttcaccgc 
cactttgaga agatgctcta tgaccaggca 
cagctctctg gtgatgaatt ctactctgac 
cggagcctga gccaccggtc cggaggcttc 
gagcggggcc agcggcccaa agagggcgcc 
cagctcctcc cggagcctgt gttgggtgcc 
atgaagcact acggcctcac agaggctggt 
gagctgcagg gccagaatgt gctgaccgtc 
tttggcttgg atgtggaggc cgtgcggacc 
caggcccgga agcatcggcc caagccgcac 
ggcttgatgg tgtcaggcta tgctgtgact 
aactatgcca ccaatggtgc caagttcctg 
cgcctgatgc ggacctgcta caccggccct 
tgctggggct tcctggagga ctacgccttc 
gcctcacagg agagtgcgtg gctcgagtgg 
ctcttttggg actcccaggg tggcggctac 
ctgcccctgc gtctgaagga cgaccaggat 
gcccacaacc tgctccggct gcatggcttc 
gtgtgcctat tgaccgcctt ttccgagcgc 
atggtccgcg ccctctcagc ccagcagcag 
cgtcaggcca aggacaccaa ggccctggtg 
aaggtgctga ttctggctga tggggacccc 
ctgagtaccc tccgacggtt ggaagaccag 
tgctcagtgc ccatcactga tccctgcgaa 
caaccccctt ggggtggggc agaaggtgaa 
gcagggccct atagaacctg tggccatccc 
atactcactg ccccccttgg gcacccactc 
gtaaaaaaaa aaaaaaa 



cgcgccggtg caggcctcgc cgcgagccgc 60 
ccccacagga gtcccagcag gggtagctcc 120 
agtagttcag tgcccatgcc tgctggaggg 180 
agggtcccca accgcctgat ccacgagaag 240 
cctgtggact ggtacccctg gggagaggaa 3 00 
ccgattttcc tctcagtcgg gtactccacc 360 
tccttccaga atgaggagat tggccgcctg 420 
gaccgtgagg. agcggcctga cgtggacaag 480 
agcggcgggg gctggcccat gaatgtgtgg 540 
ggcacctatt tccctcctga ggatggcttg 600 
agaatacgag aacagtggaa acagaacaag 660 
accactgccc tgctggcccg atcagagatc 720 
gccgccaccg tgaacaatcg ctgcttccag 780 
ggtggcttcg ctgaggcccc caagtttccc 840 
tactggctca gccatcgact gactcaggat 900 
accctgaaaa tgatggctaa cgggggcatc 960 
tactccacag accgccagtg gcacgtccct 1020 
cagctcgctg tggcctattc gcaggccttc 1080 
gtggccaaag gcatcctgca gtacgtggct 1140 
tatagcgcag aagatgcaga ctcgccccca 1200 
tactatgtgt ggacggtcaa agaggttcag 1260 
accgagccgc tgacctcagg ccagctcctc 1320 
aacatcagcc ccagtcagga ccccaagggg 1380 
cggtactcgc tggagctgac tgctgcccgc 1440 
ttgctcaatt cagggctgga gaagctcttc 1500 
ctggacagca agatgctggc tgcctggaat 1560 
ggggctgtcc tgggccaaga caggctgatc 1620 
aagcggcaca tgtttgatgt ggccagtggc 1680 
ggggggactg tggagcacag caacccaccc 1740 
gtggtgcggg gcctgctgga cctgtatgag 1800 
gctctgcggc tgcaggacac acaggacagg 1860 
ttctgcagtg aggctgagct gggggctggc 1920 
ggagcagagc ccagcgccaa ttccgtgtca 1980 
acgggccaca aggactggat ggacaagtgt 2040 
atgcgtcgtg tcccggtggc gttgcccgag 2100 
accctcaagc agatcgtgat ctgtggagac 2160 
cagtgcgtcc actctgtcta cattcctaac 2220 
tcgagcttcc tgtcccgcca gctgcctttc 2280 
gccactgcat atgtgtgtga gaatcaagcc 2340 
ttacgaaaac tactacatcc atgactgccc 2400 
gcatcccaac tgactagaga ctcaggccct 2460 
tgagcaccct gccaccaggt gacctcggcc 2520 
accctagaat aaacttaaca gtgtcccgtg 2580 

2597 



<210> 54 
<211> 2385 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5583922CB1 



<400> 54 

gaccgacccg ccacccgccg tagcccgcgc 
ggctcctgct cgccctggcc gccttcgcgc 
ggaactcggt gctgggcctc gcgcagcccg 
ccctgcatag cagcccggca cagccgccgg 
atgtccggat tcgagtcatc aagaagaaaa 
ctctaactcg ccccacccca ctggtgactg 
ggaccctcga ccccgctgag aaacaagaaa 
tgcgagtttc agatagccgg cttgaggcat 
accgaggacg gctcaacatt cagtcaggcc 
ggtgtgctga ggagcaggac gccgatccat 
gcttctcggg tgttatcaca cagggcagta 



gcccctggca ctcaatcccc gccatgtggg 60 
cggccgtcgg cccggctctg ggggcgccca 120 
ggaccaccaa ggtcccaggc tcgaccccgg 180 
cggagacagc taacgggacc tcagaacagc 240 
aggtcattat gaagaagcgg aagaagctaa 300 
ccgggcccct tgtgaccccc actccagcag 360 
caggctgtcc tcctttgggt ctggagtccc 420 
ccagcagcca gtcctttggt cttggaccac 480 
tggaggacgg cgatctatat gatggagcct 540 
ggtttcaggt ggacgctggg caccccaccc 600 
actctgtctg gaggtatgac tgggtcacat 660 
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catacaaggt ccagttcagc aatgacagtc ggacctggtg gggaagtagg aaccacagca 720 
gtgggatgga cgcagtattt cctgccaatt cagacccaga aactccagtg ctgaacctcc 780 
tgccggagcc ccaggtggcc cgcttcattc gcctgctgcc ccagacctgg ctccagggag 840 
gcgcgccttg cctccgggca gagatcctgg cctgcccagt ctcagacccc aatgacctat 900 
tccttgaggc ccctgcgtcg ggatcctctg accctctaga ctttcagcat cacaattaca 960 
aggccatgag gaagctgatg aagcaggtac aagagcaatg ccccaacarc acccgcatct 1020 
acagcattgg gaagagctac cagggcctga agctgtatgt gatggaaatg tcggacaagc 1080 
ctggggagca tgagctgggg gagcctgagg tgcgctacgt ggctggcatg catgggaacg 1140 
aggccctggg gcgggagttg cttctgctcc tgatgcagtt cctgtgccat gagttcctgc 1200 
gagggaaccc acgggtgacc cggctgctct ctgagatgcg cattcacctg ctgccctcca 1260 
tgaaccctga iggctatgag atcgcctacc accggggttc agagctggtg ggctgggccg 1320 
agggccgctg gaacaaccag agcatcgatc ttaaccataa ttttgctgac ctcaacacac 1380 
cactgtggga agcacaggac gatgggaagg tgccccacat cgtccccaac catcacctgc 1440 
cattgcccac ttactacacc ctgcccaatg ccaccgtggc tcctgaaacg cgggcagtaa 1500 
tcaagtggat gaagcggatc ccctttgtgc taagtgccaa cctccacggg ggtgagctcg 1560 
tggtgtccta cccattcgac i.'tgactcgca ccccgtgggc tgcccgcgag ctcacgccca 1620 
caccagatga tgctgtgttt cgctggctca gcactgtcta tgctggcagt aatctggcca 1680 
tgcaggacac cagccgccga ccctgccaca gccaggactt ctccgtgcac ggcaacatca 1740 
tcaacggggc tgactggcac acggtccccg ggagcatgaa tgacttcagc tacctacaca 1800 
ccaactgctt tgaggtcact gtggagctgt cctgtgacaa gttccctcac gagaatgaat 1860 
tgccccagga gtgggagaac aacaaagacg ccctcctcac ctacctggag caggtgcgca 1920 
tgggcattgc aggagtggtg agggacaagg acacggagct tgggattgct gacgctgtca 1980 
ttgccgtgga tgggattaac catgacgtga ccacggcgtg gggcggggat tattggcgtc 2040 
tgctgacccc aggggactac atggtgactg ccagtgccga gggctaccat tcagtgacac 2100 
ggaactgtcg ggtcaccttt gaagagggcc ccttcccctg caatttcgtg ctcaccaaga 2160 
ctcccaaaca gaggctgcgc gagctgctgg cagctggggc caaggtgccc ccggaccttc 2220 
gcaggcgcct ggagcggcta aggggacaga aggattgata cctgcggttt aagagcccta 2280 
gggcaggctg gacctgtcaa gacgggaagg ggaagagtag agagggaggg acaaagtgag 2340 
gaaaaggtgc tcattaaagc taccgggcac cttaaaaaaa aaaaa 23 85 
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<110> IBCYTB CBNObflCS, I KC . 
TfUE, Henry 
TANG , Y . To in 
BAWOMAN, OLga 
LAL,, Pre^ti 
BAUGHN, Mctriah R. 
AZIMZAI, YaL3a 

LU r Dyung Ain» H. 
YANG , Junming 

<120> PROTEASES AND PfiOT£ASE INHIBITORS 



«-.L30 PF-0727 &"CT 

<L40v To Be Assigned 
<L41> Herewith 

<1B0^ 60/l<17,$B6; 60/160,307 
<L51> 1 993 -Q&-D9 j 1999-1-0-21 

<1,60> 54 

<--170> P£RL Program 

<210> L 
<211> 44d 
<212> PRT 

c213> Homo sapiens 



c220> 

<221> jniec.,feataxe 

<223> IiicyfcG ID V>'>: 06071 BO[>L 



<iao> L 










Met 


Lys 


Val 


Val 


fro 


Sex Leu 


l 






5 




Veil 


Trp 


LeU 


Val 


Pro 


Gly LW 










20 




Thr 




Ala 




can 












3S 




Arg 


oiu 


Glu 


niu 


olu 


Asp Glu 








DO 




Giy 


ola 


Glu 


Glu 


Lys 


Ala Trp 








65 




Ala 


Lys 


Glu 


Thr 


Ser 


Asn Phe 








80 




Ser 


Met 


Arg 


Hie 


Asp 


Gly Asn 










95 




Ser 


Leu 


Ala 


Met 


Thr 


Gly Leu 










n n 




GLu 


Thr 


Gin 


lie 


Lys 


Arg Gly 










12S 




Thr 


Lys 


Fro 


Gly 


Leu 


Leu Tro 








ldD 




Thr 


Leu 


Ser 


Arg 


As-n 


Leu Olu 










1S5 




Ala 


Phe 


11© 


His 


Lys 


Asp Phe 










ill) 




Leu 


Ser 


Lys 


Arg 


Tyr 


Phe Asp 










185 




Arg 


ABn 


Ala 


Ser 


Gin 


Ala Lys 








200 




Lye 


Glu 


Thr 


Arg 


Gly 


Lya He 








215 




Pro 


Glu 


Thr 


Lys 


Leu 


lie Leu 



Leu 


Leu 


9er 


Val 


Leu 


Leu 


Ala 


Gin 






ID 










15 


Al* 


Pro 


Ser 


Pro 


GU> 


Sor 


Pro 


Glu 






25 










30 


Thr 


Scr 


Arg 


Val 


Val 


□ in 


Ale 


Pro 






■30 










45 


Gin 


Glu 


Ala 


s-er 


□ lu 


Glu 


Lys 


Ala 






55 










GO 


Leu 


Met 


Ala 


ser 


Arg 


Gin 


Gin 


Leu 






7D 










75 


Gly 


Phe 


Ser 


Leu 


Leu 


Arg 


Lys 


He 




aS 










90 


Ket 


Val 


Phe 


Ser 


Pro 


Phe 


Gly 


Met 






10D 










1.D5 


Met 


Leu 


Gly Ala 


Thr Gly 


Fro 


Thx 






115 










120 


Leu 


His 


Leu 


Gin 


Ala 


Leu 


Lys 


Pro 






130 










U5 


Ser 


Leu 


Phe 


Lys 


Gly 


Le*u 


Arsj 


GLu 






145 








150 


Leu Gly 


Leu 


Ser 


Gin Gly 


Ser 


Phe 






ISO 










l£5 


As-p Val 


Lys 


Glu 


Thr 


Phe 


Phe 


Asn 






ns 










ISO 




Glu 


Cys 


Val 


Pro 


Met 


Asn 


Phe 






19D 










195 


Arg 


Leu 


Met 


Asn 


His 


'Tyr 


He 


Asn 




705 










210 


Pro 


Lys 


Leu 


Fhe 


Asp 


Glu 


He 


Rbji 




7.7.U 










225 


val 


Aap 


Tyr 


He 


Leu 


Phe 


Lys 


Gly 
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230 



Lys 


Trp 


T r^i i 

Lieu 


T)ir 


DVi\ 




Asp 




Val 
















Phe 


His 


Leu 




Ly* 




Ly B- 


U/n r 


Ele 




















G ly 


Ala 


Gly 




1*119 


J, 1 Q 


scr 


tOi r 


Phe 








Z tz> 










His 


V&l 


L£U 






Pro 


Tyr 


G Irj 


Gly 




















Val 


Leu 


Met 


Cd 1 u 


Ly & 


U^i r- 
L 


n lu 
*ui J.y 


A.Ep 


His 




















t^eu 


Thr 


Thr 


Asp 


LffLi 


\?=. 1 
v a x. 




LilL 


•I'm 
.irp 










320 










Arg 


ABD 


Met 


G 1U 


Vdl 


Ft}© 


nVi d 


0 


J_iy s 








335- 










ryr 


Glu 


Met 


His 


G lu 


Leu. 


L EfU 


Ar 


L?l LJ 


















Phe 


Ser 


Pro 


TiV r*. 


h "I -i 

Ala 


Asp 


L 811 




L?-l<-? 




















Asn 


Leu 


Gin 






Ara 


V-tll 


L&li 












!£B 0 










As p 


r ,,- ln 


Ar q 


^ 1 vr 
*J 1. 


i. ji J- 




Ala 


VaL 


Ala 
















Thr 


Ala 


Tyr 


Sex 


M*t 


Pro 


Pro 




lift 










410 










Hi* 


PhH 


Mist 


11^ 


Tyr 






Thr 


Sisr 










425 










Gly 


i\rtf 


Val 


val 


Asn 


Pro 


Thr 


Leu 


Leu 



440 

<210> 2 
<211> 565 
<212> PRT 

<213> Homo Kaplans 



<220> 

<22\> mis p_f Bti t-Ure 

<223> Incyte ID Nd ; 1U5S1CD1 



<40O> 2 
















Met 


Sar 


Gly Arg 




Lye 


Arg 


Glu 


Ser 


1 


















LyE 


Arg 


Glu. 


Se-r 


Glu 
20 


Ser 


Arg 


Gly 


Ser 


Glu 


Arg 


Asp 


Arg 


Glu 
35 


Arg 


Glu 


Pro 


Glu 


Ser 


Pro 


Val 


Arg 


Val 


Lys 


Arq 


Glu 


The 








50 










Glu 


Ala 


Pro 


Ala 


ser 
65 


Val 


Val 


Pro 


Ph* 


Arg 


Glu 


Val 


Asp 


Glu 


Asp 


Ser 


Glu 


Pro 






B0 










Lys 


Asn 


Gly 


Ara 


Val 


Asp 


Ser 


Glu 


A::p 








95 










Pro 


Tyr 


Leu 


Asp 


Thr 
110 


He 


Asn 


Arg 




Glu 


Ly* 


Lfu 


Oyr, 


SM' 
125 


He 


ser 


Leu 


£<^r 


Cys 


Leu 


Val 


t'ys 


Gly 


Lys 


Tyr 


PftC: 


Gin 






IdO 










His 


Ala 


Tyr 


He 


His 
155 


S«>r 


Val 


Gill 


Phe 


Asn 


Leu 


His 


Thr 


Leu 


Lys 


Pb*? Tyr 


Cys 










170 










lie 


lie 


Asp 


Ser 


Ser 


Leu 


Glu Asp 


He 








165 










Thr 


Phe 


Thr 


Lys 


Gin 


Gin 


He 


Ala 


Asn 








200 
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235 2dU 





Thr 


1 u 


Viai 


Asp 


'Hit* 














Lys 


VBJ. 


PTO 




Mut. 


Tyr 












3 t n 


ABp 


Ly 






AiQf 
















A3Tj 


A La 


1 III. 


Ma ^* 


Leu 


val 












"4 ri n 
J V ij 


Leu 


H 1 a 
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130 










135 


Tyr 


Lye 


Leu 


Glu 


Thr 


Tyr 


Leu 


Lye 






145 










150 


ABp 


Asp 


Asp 


Pro 


Val 


Gin 


A La 


Glu 






160 










165 


Lea 


Leu 


Gin 


Asn 


Glu 


Ser 


Thr 


ABTj 






175 










180 


Lys 


Val 


CyB 


Tyr 


Ala 


Arg 


Val 


lieu 






1?0 










195 


Glu 


A16 


Ala. 


cm 


Arg 


Tyr Asn 


Glu 






205 










210 


His 


Glu 


Ster 


C1U 


Arg 


hm 


GLU 


Al* 






220 










22S 


Thr 


n« 


L<*U 


Ale 




hid 


GLy 


Gin 






235 










2dD 
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Gin 


Arg 


Ser 


Arg 


Met 


Leu 


Ala 


Thr 


Leu 








245 










air) 


Gl.n 


Leu 


Ala 


Ala 

2S0 


Tyr 


Gly 


11© 


Leu 


Arg 


n.a 


11* 


Arg 


GLy Asn 


Gin 


Leu 


Gin 










27 5 










Met 


Pro 


His 


Gin 


Lva 
290 


Ala 


Thr 


Thr 


Ala 


Aep 


Arg 


Ala 


val 


He 


Glu 


His 


Af.ii 


Leu 






3D5 










Tyr 


Asn 


Asn 


lie 


Thx 


Phe 


Glu 


Glu 


T.iCJU 








320 










Pro 


Ala 


Ala 


Lys- 


A La 
325 


Glu 


Lys 


lie 


Ala 


Gly 


Ars 


Met 


A^n 


GLy 


Phe 


He Asp 


Gin 






350 










Ph* 


Glu 


Thr 


Al'13 


GLU 
365 


AlR 


Leu 


Prr> 


Thr 


Ser 


L*?U 


cys 




GLn 
3B0 


Val 


Asu 


A?n 


Leu 


Thr 


AIjA 


?rcj 


Glu 


Txp 
395 




ALa 


njn 


ALa 



GUI 



<2lO> 11 

<211> 172 

<21i> PRT 

<213> Homo i:apl.ans 
<Z?.d> 

<221> misc_f mature 

<223> Incyte ID Tlo: 3489039CD1 

■:40O> 11 

54*t Leu Leu Pro A&n He Leu Leu Thr 

1 5 
Lye Thr Thr Leu Gly Lys Glu Leu Ala 

20 

Tyr He Asn val Gly Asp Leu Ala Arc 

iS 

Gly Tyr Asp Glu Glu Tyr Asp Cys Plx> 
50 

VbI Veil Asp Glu Leu Asp ASL) Gin Mut 
65 

Val A*p Tyr His Gly Cys Asp Phe Phe 
80 

114* Ve.1 Pn* Val T.iiu Arg Thr Asp Thr 
95 

Leu Glu Thr Arg Cly Tyr A&n Glu Lys 
110 

dlv cys Glu lie Phe GLn vaL Leu Tyr 
125 

Tyr Lys Glu Glu He Val His GLn Leu 
140 

Glu Leu Glu abd Asn Val Asp GLn He 
155 

Trp He Lys ABp 13 La Asn Ser 
17 0 

<210> 12 

<21L> 517 

<21Z> PRT 

<213'- Homo sapiens 

<220> 

<221> ml sc^featur* 

<222> Incyte ID No; 54328790D1 



pcivusuoyzisT* 



Phe 


Lye 


Asp 


Glu 


Arg 


Cye 


2SD 










25S 


Glu 


LyB 


Met 


Tyr 


Leu 


Asp 


265 








270 


GLU 


Phe 


Ala 


AID 


Met 


LeU 


280 










265 


Asp 


my 


Ser 


Ser 


I Le 


Leu 


255 










3 00 


L«u 


Set 


Ala 


Set 


Lys 


Leu 


310 










315 


Gly 


Ala 


Leu 


Leu 


GLU 


He 


325 










33 0 


Ser 


cm 


He t. 


Ho 


TtlY 


C\l. L) 


M0 










345 


Tie 


Aep 


Gly 


lls> Val 


His 


355 










360 


Trp 


Aep 


Lyi; 


Gin 


He 


Gin 


370 










375 


Leu 


Glu 


Lys 


He 


Ser 


Gin 


365 










390 


Met 


Glu 


Ala 


GLn 


Met 


Ala 


400 










405 



Gly 


Thr 


Pro 


Gly 


Val 


GLy 


10 










L!i 


Ser 


Lya 


ser 


Gly 


Leu 


Lys 


25 








20 


GLu 


Glu 


Gin 


Leu 


Tyr 


Asp 


AO 










45 


lift 


IA*U 


Asp 


GlU 


Asp 


Arg 


£5 










60 


Arg 


CilU 


Cly 


Gly 


Val 


Ufi 


70 










75 


Pro 


Glu 


Arg 


Trp 


Phe 


His 


«5 










90 


Asn 


Val 


Leu 


Tyr 


Glu Arg 


100 










LPS 


LyB 


Leu 


Thr 


ABp 


Asn 


lie 


1L5 










L20 


GLu 


Glu 


Ala 


Thr 


Ala 


ser 


1.™ 










235 


Pro 


Ser 


Asn 


Lye 


Pro 


GlU 


14S 








150 


Leu 


Lys 


Trp 


He 


Glu 


Gin 










165 
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<4on> 13 



Met 


Leu 


Ser 


Ser 


Arg 

S 


Ala 




Al a 


A J. 51 


1 

Als 


lie 


Gin 


Arg 


Phe 


LeU 


Ar>g 


Thr 


1 - r 

Ly 










20 










VaL 


Mat 


Lys 


Abd 


Trp 


Gly 


Val 


11* 


Giy 










3b 










Ala 


Ala 


Gly 


31a 


Tyr 


Va 1 




Trp 


ciy 








SO 










Lys 


Arg 


Arg 


Lys 


Gly 


L*<U 


Val 


Pro 


tiiy 








65 










Thr 


cys 


Phe 


Met 


Ast) 


S^r 


L^U 


LGU 


Gin 








SO 










Ala 


Phe 


rie 


Arg 


Trp 


Lieu 




Gl U 


Phe 










9 5 










*sp 


a In 


Lys 


Glu 


Pro 


Pro 


S er 


Mis 


nl- 

Gm 








110 










Leu 


His 


Leu 


Leu 


Lys 


Ala 


Leu 


Ser 


Cys 










125 










Glu 


Val 


Leu 


Asp 


Ala 


Ser 


cys 


Leu 


Leu 










14 n 










Arg 


np 


Gin 


lie 


Ser 


Ser 


Phe 


Glu 


Glu 






155 










Phe 


His 


VaL 


lie 


Thr 


iter 


Ser 


Leu 


Glu 










IT L) 










i?ro 


Arg 


VaL 


Thr 


His 


Leu 


Phe- 


As-p 


V&l 








185 










Ser 


Glu 


He 


Thr 


Pro 


Lys 


Gin 


Il« 


Thr 










200 










Pro 


Hir> 


Pro 


Thr 


Ear 


Asn 


HIS 


Trp 


Lye 










215 










Oly 


Arg 


Leu 


Thr 


Ser 


Aen 


Met 


Val 


Cys 








230 










Ser 


Pro 


Val 


Arg 


Phe 


Asp 


Thr 


Phe 


Asp 










245 










Pro 


Ala 


Ala 


Thr 


'lip 


Gly 


His 


ezo 


Leu 










~^60 










His 


Mis 


Phe 


lie 


Ser 


£Ser 


Glu 




V/tl 




















Asn 


Cys. 


Thr 


Lys 


riw 


Oil) 


Al a 


Lys 


Gly 








290 










Val 


r.lti 


Ki£J 


Gin 


Arg 


Thr 


Tb* 1 


Phe 


Val 










305 










Lys 


Lcru 


Pro 


Gin 


eye 


Leu 


Cys 


lie 


His 








320 










Ser 


Ser 


HiB 


Gly 


Thr 


Pro 


Leu 


Lys 


Arg 










335 










Asn 


Glu 


Phe 


Leu 


Met 


Met 


Asp 


lie 


Tyr 










350 










Hia 


Lys 


Pro 


Ser 


Gin 


His 


Asn 


Pro 


Lys 








3Sb 










Pro 


Thr 


Leu 


Glu 


Leu 


Gin 


Asp 


Gly 


Pro 










3 BO 










Leu 


Asn 


Gin 


Pre 


Gly 


Ala 


Pro 


Ly £■ 


Thr 










39S 










A La 


Cys 


Ser 


Pro 


Ser 


Leu. 


Leu 


Pro 


inr 








410 










Phe 


Pro 


Leu 


Pro 


Va 1 


Vol 


Pro 


ASP 


TVr 










G.Jb 










n v. i—. 
File 


Arg 


Leu 


M DI- 
LI! cr L. 


A] n 


Val 


V/al 


Val 


His 










4 4 1) 










aly 


His 


Phe 


Val 


Thr 


Tyr 


Arg 


Arg 


Ser 








455 










Pre 


Lea 


Ser 


Thr 


Ser 


Asn 


Gin 


Trp 


Leu 










47D 










VsJ 


Arg 


Lys 


Ala 


Ser 


Leu 


Gin 


Glu 


Val 



435 



net 


i nr 


a la 




Asp 




1U 










-i c. 

ji ^ 


Als 


A l*i 


Val 


A J* (3 


Tyr 


l<ys 


Jb 










30 


ciy 








Ale 


Leu 


■ay 










45 


Pro 




Tnr 


Ij 1 U 


Arg 


Lys 


bb 










60 


LCU 


va l 


ABn 


Leu 


Gly Asn 


/vl 










75 


Gly 


Leu 


Ser 


Al a 


Cys 


Pro 


85 










90 


Thr 


se r 




Tyr 


Ser 


Arg 


100 










IDS 


Tyr 


Leu 


Ser 


Leu 


Thr 


Leu 


115 










120 


Gin 


Glu 


val 


lhr 


Asp Asp 


11D 










13 5 


Asp 


Val 


Leu 


Arg 


Met 


Tyr 


-i « c 










150 


/SI _ 

(sin. 


Asp 


Ala 


His 


Glu 


Leu 


14)0 








16S 


Asp 


Glu 


Artj 


Asp Ar^ 


Gin 


175 










180 


Bi* 






GIU 


CJ.Xl 


01" 


1 $0 










195 


Cys 


Arg 


Thr 


Arg aly 


Ser 


205 










210 


Ser 


Gin 


His 


Pro 


Phe 


His 


220 










225 


LyB 


His 


Cys 


Glu 


His 


Gin 


235 










24D 


Ser 


Leu 


Ser 


Leu 


Ser 


lie 


250 










i5S 


Thr 


Lgu 


Asp 


His 


Cy% 


Le-U 


265 










270 


Arcr 


Asp 


Vftl 


Val 


C.y* 


Asp 


280 










285 


Thr 


Leu 


Asn Gly 


Glu 


Lys 


295 










300 


Lye 


Gin. 


Leu 


LyB 


Leu 


Gly 


310 










315 


Leu 


Gin 


Arg 


Leu 


Ser 


Trp 












13d 


His 


Glu 


His 


Val 


Gin 


Phe 


300 










J45 


Lys 


Tyr 


His 


Leu 


Leu 


Gly 


3 bo 










360 


Leu 


Asn 


Lys 


Asn 


Pru 


Gly 


* n a 
3 / U 










375 


Gly 


Aid 


Pr» 


Thr 


Pro 


Val 


IOC 

3 lib 










390 


r"\ "1 ^ 

i^m 


lie 


Ph.» 


M*>t 


Asn 




40D 










405 


Leu 




Ala 


Pro 


HGt 


Pro 


alb 










420 




Co r 


ssr 


Tlir 


Tyr 


L«*U 


A O A 
U J U 










435 


HIS 


Lly 


Asp 


MeL 


His 


.Ser 


^ A C 
U 41 D 








450 


Fro 


Pro 


ier 


Ala 


Arg 


Asn 


46D 










ae>5 


Trp 


Val 


iier 


Asp Asp 'Ihr 


475 










4H0 


Leu 


Ser 


irer 


Ser 


Ala 


Tyr 


490 










495 
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<22Q> 

<221> miac_feature 

<223> IncyCd ID Nn: 4S93873CB1 

<4Di>> 51 

sgaatbgacc tgacaaatat CgaccaggCt gcbgtgatcb tcaaacacca ttttcctgtg 6D 

BStagaagtg Atgcaatoct aaagacfctgu gccc^ccte agtgccbbfcg ctetatfaata 120 

cgbccagctb ggctbtggct actgggaaca gggatcctgg cctcbgtcca ctgtcagccc 180 

crtctttfct-c atgtfagataa aagtctr/caij gggcctcaac cccccaugc** t eago r.c tea 240 

gagccagccc ccgcctacca cagaatcaca cccaccabta ccaabttbgc tttgcgbttg 3 00 

tataaatfagc tggcagcaga cgcccccgga aac6tctr.ee tctcgccagt gagcar.ctcc: 3 60 

accaccctgg ccctgctcbc befctggggee caagctaaca cctcagctct gatcctggag d20 

ggcctgggat tcaacctcac agaaacccct gaagecgaca tccacca-ggg cttc.c <jg$.qc 4B0 

Cbcctccaca cccttgccct gcccasjcc cc aaacbegaac taaaagtagg aaactccctg 540 

ttcetagaca agegactaaa geeteggcag cactattbgg acagcatcaa ggagctbtBt fcOO 

Qgagcttbtg otbttU--ttj'> coauttcnua gaLtctgtta caacbgggag gcagatbaat 660 

gaccacttga gaaggcaaac ataegggcaa gtcgtggact grctcccgga gttcagccag 7.20 

gacacgttoa fcggt.l.ttfcfifC caattttCatc CtvLtcaaag ccsngtggaa gcacccbttc 7 60 

agtcgctacc agacccagaa gcaggcaagt ttettbgbgg atgagaggac ttcbctccag 840 

gtccccatga tgcacc&saa. ggaaattfea^ asfcttoetet atunor'agga tttogutfcgc ?Q0 

actgtcctcc agatagaaba cagaggaaat geettggege tgctggtcct cccbgacccg 960 

gggaaaatgs agcaggtgga ggctgetety cragi^aes.D'* ccctgagaaa ar.fjggf/ocMia 1D2D 

ttgctcctgc ccaabctgbt ggatttgeae fctgccaaggb tttcaatbbc tggaacafcat 1080 

aacctggaag acatacttcc ccaaattggt ctcaccaaca tactcaectt agaagutgac lldD 

tbctcagnag tcactgggca gctcaacaaa accabctcca aggtgtcaca caaggegatg 1200 

gtggacatga gtgagaaggg gaccgaggcc ggggctgctb caggcctcct ctcccagccc 1260 

r,-catctctya acaccatgtc agn<:<?eacat geccacbfcca acaQflc-ettt ccbcttgctc 1320 

cttbgggagg bcaccaccca gagebtaetc ttccbgggaa aagtbgtcaa cccagtcgca 1380 

gggtaaceat ggtgggesggn caggagttat ctb.^bc'ttab crtggaccaa. acagni-agsc 1440 

cagaaccag/c ctgcatcctg gggcttfebat gtggtbcagb taatcagbgt gecaagatte 1500 

taabaaagct gaccbtgggt tctgbgaaaa aaaaaaaaag aacaaccaac gcebaaagtt lSSD 

gcggcgtaaa ggatgctgaa agoaaatacc ccgnbttgcc catccaaatt baacatoagg 1620 

ggggggcegg cagtacagaa ggtccctcag agtgtgtttg atggccgtcg actgattggc Lti60 

gggggggtrgg aga.g*g-tttt ta c gc ate r.f. tgi:«<:<j bgtt atfCncccygij adOatfUtfiigc 174 D 

ccabaggacc cccggagcbg gaggqttcrtt ttgcgccggc actcbtgggg gggbgtttab 1800 

gaccgacttt tagtggcaca aactgtgggg cttctcatat gtgtgcrgcg^ gcttcrtgtt.t 1B60 

O10> 52 
<211> 550 
■c212> UNA 

<213> Homo sepiane 
<22D> 

<223> mcybe ID No: S208O01CB1 
c400> 52 

gtfsgggtcfgc bggaggftca« ggtgtaggcc ttgcc^gotgi. ct£jflgttoi5t cgcoaet[j«i 60 
ggagcaggac bgbtbctgga aacactgggc tgctggtgac ccgbcaaact Cctcccacaa 120 
attctgaatc cgagaaagbg aaggaaggat ggbgggg^ag tff^ggaggeft g^agcesgaca 180 
ccacaggqac ogaccagaga bgcggtggag acagoggagc btccttcbca ggctgtttcb 2d0 
gggaagcctg agaggegg-cc agcaccaccc tccgct.caet ctc.r.cct-CBQ •cc'ttt.r.oijfCit: 3 0D 
bcccttcLca acccbttcbc fcccbccbcgc ctctbccctfc bctbgcbgtc btgttagtcc 360 
cbgtcccaaa acbccbggct cctbbgtbct gctgccgtgg cccccaccca ggagg^gtftc 42D 
tcaggcat;^ tuccccccac gauuystc^a cacnacaggfc tcabgcbggg gcbgggggag 480 
ccccgcbggg btcctgabgc cttgtgcaca gggaagtgct gcagtcactt btggactctc 5dD 
cbgaatgtOt " 550 

<210> 53 
<2U> 2597 
<212i- DNA 

<213> Kcjcio sapdona 
<220> 

<221> misc_f eabure 
<223> Dicyt* TP Mo r 
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gttaatsbtc ctgctutsag asactbtaaa gttbcaaata cccfc&Gabgc tb^acl-utcc 660 

atagtggatb actatgagcc aaggngacag gcggtgagaa gttacaactc tgaagbgaag 720 

ctgtcct.ccC gtsracotttu c*gtga.tgr.c c.&ggqctgcc gtccttytga ggatggagcb vbq 

Icaggcbccc atcatcacbc ttcagbcstt bfctafcttbct gttbcaagct tctcbacttb 840 

atggfti^ett fluotgbgfttt r.etrttctaaa ggfscbctgtg Caacacteac atttccagta £00 

gtcacabgtg atbgtbttgt ttbcatagaa gaabactgct tcbabbttgo aaaaftsjacbb SCO 

etttttcttt etatggggtt gcagggatgg bgtacaecag gtcctagcat gbatagctgc LQ20 

atagabbtcb tcaccbgabc ttbgtabgga agatcacaat ^ttatccngtU iftflty tctat L080 

attttccccb ctcaaaatct Ctbagaattt Cttbggaggt gtctgctctc tccagaabaa 11.30 

aggbabbacb ttagsatnas tattctcctc <*t tttulgaa agaaaknaac cbagattebt 1200 

aagcBttatC acacabccat gtbtgctbaa agabggattt cccfcgggaat gggagaaaac J.26D 

agccaycacg aggag^trtca tetjSttccct bccoacctce i^OD 

<210> 50 

<2l\> 22dl 

*2\2> DHA 

<2L3> Hnrao sapiens 

<220> 

<22L> misc_£eatajre 

-'22J> Incyto ID Ho: 3B83861CB1 

<40D> DO 

gctccccgag ccgggctgca ccggaggcgg cgagabggtc gcgcgcgtcg gccbccbgct 60 
geixtjccctg t;*grtgi;ta.<! taCDycru^.a. etfbguactfcc i^gccogoDs" agcgt: cfgagg "I2D 
ccaggagctg cgcaaggagg cggaggcatb ccbagagaag tacpgatacc tcaatgaaca 180 
ggtcnejoaea gctcccacct ccactegatc cage gab gee ateagagegb ttcagtgggt 240 
gtcccagcta cctgbcagcg gcgbattgga ccgcgccacc cbgcgccaga tgactcgtcc 3 00 
ccgctgcggg gttacagaba ccaacagtta tgcggcctgg gctgagagga tcagtgactt 3 60 
gtttccbaga ^aCMg-y «c aaat^auycg taagaaacSC- tttscaaag<~- aagg"t»ai>eta 420 
atggtacaag cagcaccbcb cctaccgccb ggtgaactgg cctgagcatc bgccggagcc 680 
ggcagttrrgg ggcr^CQtgc gcgecgcotb ccagttg-tgg ageftftCsg-tCt cagegctgrja 540 
gttctgggag gccccagcca caggccccgc tgacabcegg cbcaccbtct becaagggga 600 
ccacaacgat gggcbgggca atgcctbtga tggcccaggg ggcgccctgg cgcacgcctt 660 
ccbgrc-cccc eg cEjgrjgaajj cgcacbbcga ccaagaLgag cgc-tgsrbccc Icagccgcco 720 
ccgcgggcgc aaecegtbeg tggbgctggc gcacgagabc ggtcacacgc ttggcctcsc 7 80 
r>v&ot.W}<w ijiyQccijr.qrt} cgcsttiwC^gi: rj<:r:ct.ar;t.ao aatfay aot.gg gcogcgacsgc 84 0 
gctgctcagc tgggacgacg tgcbggccgt geagagecbg tatgggaagc ccctaggggg S0O 
ctcagtggcc g C ccagc C ec caggaaagct gttcactgac tttgacfarct gggact c-eta S60 
cagcccccaa ggaagscg'cc cbgaaacgoa sggccctaaa tactcjccact ct tc-c b tcga L020 
Cgccatcact gbagacaggc aacagcaact gtacabtttb aaagggagee atbtctggga LOaO 
ggbgecajjet gatSa^euteg Ucto«oa?jcc ccatccacbg caijoaaaoat gggtcgoact LI 40 
gccccccaac abtgaggctg cggcagbgbc abtgaatgab ggagatttct acttcttcaa L200 
*4fi3g!30tcoa tgrtgrfagg-C tocgygaccc caatSCCAgtg Lgijjjgt.ot.ocs r<atagctgtg T^fiO 
ccgggcaggg ggcctgcccc gccatcctga cgccgcccbc tcctbcccbc ctctgcgccg 1320 
ccrtcebcctc tbcaagggtg cccg^tacba ccjtgcUjgcc cg&gggggac tgcaegtgga 
gccctactac ccccgaagtc tgcaqgacbg gggnggcabc cctgaggagg tcagcg^cgc 144 0 
cctgccgagg cecgatggcb ccabcabcbt cbtccgagac gaccgcbact ggcgcctcga L50D 
ccaggcc-aaa otocaRgcaa ccnccboggsj <3cgcbggg<?c accgapctgc cctccabcsca 15^0 
ctgctggcat gccaactcgg ggagcgccct gbtcbgaagg caccbcctca cctcagaaac 1620 
tggtRgtijct oCcagsaoaa natcaLgttc cccacccc<?g qagcacaacic ccfcctLasaa 1&8D 
gecbebgagt ccctctgcag aagaceggge ageaaagect ccatctggaa gcctgtctgc 17J0 
ctbtobtcct Cga«y«aCco esgcattgtct LLctctgbcc ccaocacat^ a^gdtyuuog IBOO 
tgggabcaat cctaggaaaa gcaaaaaagg gccccagacc cctbggccct tbcctccgag 1S60 
gact tc:tRf,c c bco c^agu° vbttyttt-ct tcBBctaaatf utacastbc"; tbtcrtitisagg 19 2D 
Caacagcact gggatccaag cagggggatg aaaaactcag cagagaaabt cgagaccabb 1980 
LtgcR/*yact gtaoccttt;b t oagga c-n couby^Qtca gtbctlyaaa aaoogtsxci 2040 
tatctagtca gaggccccac ccccaggaag catggatggg gatgaaggca caggcgtcbc 2100 
caacctcaga ggcot^LLtct ogcgtoaggw cacagagbga gaggga?incb ^atccasoco 2160 
baccagtccc bggcttbttg tctggggctg gaataaagag gtgccttcag cbggbgggcc 2220 
gagaggc-ogi} aa&aaaaaaa a 2241 

■^210> Gl 

<211> I860 

<212> DNA 

i 2L3> Hk>Tino sapiens 



41/44 



1 '0903A2T 



WO Ul/lOyUJ PCT/USCI0/21D78 

tgccc&gugg eigcbgacgag cagaggagaa gcgbccctbt ccngatgcbb cbgnbgctgg 7 60 

agaagatcca ggacagcegy cagaaagcag tgcggcccct ggagctggcc tactgcctgc 840 

agaagtgc™ c^bgcccbtg btbg<^caae atg*r.rjctg<> eeaaetfltac rtcaaactct &D0 

oqaacctgat taaggaccag atcactgatg bgcactbggt ggagagacbg caggccirtgb ?G0 

Xtacsatccq ggbgsagga* tocttffaCtt gcrgttgsctg tgccaLggsg agtagcagaa 10-0 

acagcagrat gctracccCc ccactttctc tttfctgatgt gyaCtcaaag cccctgaaSA 1GSP 

cssotogaran cnccecsfcae Cgcttcttrc agcccaggga gttatcaagc aaaagcaagt lldO 

gcctrcgtga gaactgtggg aagaag-accc gbgggaaaca gutctbgaag c tgaec ft*t t 120& 

tgccccagaC tctg^aaCt: ca^ctcatgc gattctccat caggaattca cagacgagaa 1260 

agatctgcca ctccctgtac btcccccaga gclfcggabbfc cagcetmat^ cttccaatgs U2D 

egctfagagtK ttgtgatgct gaggagcagt ctggagggca gtatgagctt tbtgcbgtga 1380 

tbgcgcacgt gggaabggca gacbccggbc attacbgtgt ctacatccgg *atgctgtgg 144 D 

abggaaaatg gttctgctcc aatgactcca atatttgcct ggbgtcctgg goagacabcc 1500 

agtgbaccba cggaaatccb aactacearst g^ugaaac bgcat-.atet.t ctgstttaca iSbD 

tgaagatgga gtgctaatgg aaatgcccaa aacctbcaga gabtgacacg cbgbcatbbt 1620 

ccatbtccgt bectggatct augga^tctt ctaauagaCt. ttgeaatgag gagsagcatt 1660 

gttttcaaac catataacCg agccttattt ataattaggg atattatcaa aatabgbaac 1740 

catgaggccc ctcaggtcot gatcagt.o&g aabggatgct ctcaccagoa gacccggcca 1800 

tgtggctgct cggtcctggg fcgcfccgcbgc tgbgcaagac atbagcccbt bagbbalgag 18150 

cctgtgggaft otttraggggt tcecngtggg gagagcagtg gcagtgggag gcatctgggg 1920 

gccaaaggtc agbggcaggg ggtatbbcag tabtabacaa clgcbgtgac cagncttgta. 19BO 

tactggctgii atatcagbgc tgtttgtaac ttttcactbt gagaaccaae attaatbcca 2040 

tabaaatcaa gbgtbbtgfcn acbgctabtc atttattcatf i>aa*t*LEr.a. ttgatc^bct. 21 0O 

ctcctccaca agatagtgtg ataaacacag tcatgaataa agttacbtbc cacaaaaoaa 2160 

aa ~" " 2162 

<210> 48 
<211> 578 

<^i2> nw 

<213> Homo sapiens 
<220> 

<221> miBC_f eature 

<221> Jrniyf.n ID No : 309730 1CB1 

<dao> 48 

gaaaggwca ccbctfftcet. Qgbtgagtfgt cbecagtfgtL ctr.tgggg^c cgaggaattt b0 

cagacctgaa caatcaggtg gacaacfctcfc tgtgactbtg ggtabbcbga aabbactcab 120 

ottffsct^ca ctgtgcctgg atgtagctgt gCgttttgtg tgtatcctgt tcatggrcaa 160 

tggtggcaga gbctacabag aactabgcbfc cgtggtgfcbc tggggaaaao titCtcgaott 240 

gttggctata ctatbcaata tggctgtaca gctcattgtg cctccgaata cgctggtggb 300 

gttabcabgg bgccaatgtjg Lcabgbttgg- Uagaaggtg aeaabiibaca gaattct^ca 360 

gattccaggt gctatggacc tattccatat ggactaataa gaggacgaab ctbctbtaag A20 

abttggcltc bgasStfiaCtl; TggfttCttca ogtgct»gc« uf.ftfatygcca cagatbttct 460 

gatgattagt aagcatttat tcfctbcgact tgabbattgb ctcctbttca bgbgcsattba 540 

LLacboccgt tgaaaccigtg tactraciiaa ta^actet 578 

^211> 13C-0 
<^12> DNA 

<213> Homo sapiens 
<220> 

<221"> ml BC_f eature 

<223> Incybe ID Na: 365B034CP1 

<400> 4 9 

agggaatccc aattatgagg cggctaagca ggcaaagaaa tagcttgggb ggttbtgcat GO 

ctacbcngga taccacbgtg gcLtlaadgu ctgbcbgaaC tr.y(?6nji^<!t; Hiar.gaa taca 12D 

gaaaggRcaa atatccaagt gaccgtgacg gggcctagcc caccaagtcc tgtaaagbbt 180 

ctgabtgaca cacacaaccg ctta^tcotb c-agacagcag nRcbtgctgt ggt^cag<:<:a 24 n 

s^gg^atjcta atatttccgc aaatggtttt gga^t.tgcta tttgtcagct caatgttgta 300 

tataatgbga aggcttc-tgg gtcbfcctaga agacgaagab ctatcoaaaa b^agaagcc^ 1^0 

ttty^ttt--ag abgttgct.gt aaaagaaaat aaagatgatc tcaatcatgt ggatbtgaat <120 

gtgtgtacaa gctttbcggg cccgggtagg agtggcatgg cbcttatgga agttaaccba 480 

tta*gti3gct tcatggtgci^ ctcragFiEgca atttat^tga gcgagacagt gaagaaagcg 5d0 

gaatatgatc abggaoaact caacctctat ttagatbcbg bRciabgaaac coagttbLgt 600 



AO/44 



BNSDOClD <WO 0110903A2TI > 



WO 01/10903 

atatttatca ar.^asjgcctt ttQctc:ctt.o a»aasaa!vi» a 
<2lD> 46 

•r.21l> 1910 

<?\2> DNA 

<213> Homo Bapiens 
<220:> 

<221> miec_feature 

<223> Inc/te ID Ho; 2323R17CB1 



PtT/USU0/21S7S 



<400> 46 
gtactccgcg 

t ccaagaagt 
ctggtaa-caa 
ccaaaatttt 
gtgatgttgg 
taaacctcab 
ataagaatcg 

actgtgatct 
oaattaagijc 
fcccatgtagg 
atgatstaaa 
tgcgctacct 
atgbactgaa 
atgcaggccc 
tgtcagtagt 

gtatagcata 
agccatttgb 
attabtattg 
ctcttgttcc 
bet tatty 
aagttacttt 

ggggcccggt 
tacaggcgtc 
atLttactcg 
tgttggaaat 
tfacgtbctcu 
ggtagtggtg 
ttggtctegt 



cgcagccctg 
tetgtgtatg 
ggatgaagaa 
ttgcttagtg 
aaagaattgt 
tgcatactat 
•ffLtt«.»tg»t 
gactctgaaa 
ttfctyuaa Ug 
gggaatgcaa 
^ataaac: eta. 
cctcatgttg 
&aacagtggt 
tgatgtcasjc 
aagcaa-cact 
aoactabctc 
cagcaacaac 
cactttctc't 
ttcaga.ctta 
ggtagatgga 
gacatcaact 
agtaggtcaa 

tttcaoatgt 
aaagggggcg 

cccttogggg 
tcggaaagag 
tctgtc-cccis 
aagtgcttgg 
tatotjgettij 
ttgcacctgt 
agctcgtgge 



gcggtccagc 
gaaeaatccc 
attaaaaagg 
ccagtagaaa 
ctatatatba 
gc Cgcgaaac 

tacccaagaa 
Ctgoaanttn 
agfcgtgatag 

aaagaaaafcc 
atacaacByt 
tgcaac-aaaa. 
accctggaag 
ay tgaaactc 
atagagggag 
catatctaca 
attcaaatgg 
cgtgtatatc 
batggagaab 
cagccatgaa 
sti.aatattc 
gtaaaagata 
grc-ccgatta 
ggpttaacca 
aagtcgtgct 

tggagtctcg 
gaattggtcg 
atttatattg 
ggttgttgtg 



gcaaatctcc 
agaBaattaa 
ggctagcagc 
gagttacagg 

tgctccagaa 

tgactggaaa 
fa.1. r.cat.«at t 
catttgcfcac 
tactgtacgg 
actgtc-ttsb 
Latgtgatgc 
Laacfccatoa 
Caatagatct 
ttacttca^a 
aaggacttgt 
cctggggaaa 
gfctgtcbaaa 
Ctgcagaagt 
cttatrjaeca 
aaagtagctg 
tatagcctat 
ataactgtat 
gtgagctcgt 
gtutttc-^ijn 
gtccctgggc 
atagtsocgc 
ctcggtgttg 
tggccccgct 
■gtcccbc-cy c 
ttcctcaacc 



aaaatctggb 
tccctttata 
aysaatcacia 
tgaagatttc 
tyXbygatst 
acaacttaac 
attgatt^t. 
caaaattgaa 
sgsgB^atta 
agtactt\act 
tgaacaggaa 
tgcacbacaC- 
actgtatctg 

tauaal'-tfStg 
ttcctttaac 
caae&ggagt 
tgc-actt tea 
caaatctgat 
accagaraat 
ctccaatggc 
ct<:ntotaat 
taaaataatt. 
ttCcttttat 
toasctbtgt 
ccecgcgggt 
caatiggtggt 

tefctefctctg 
cacgcgtgtg 
gaegbaggta 
tcccgtggtt 
cocccgggUt 
cacacacaat 



ttgeatatae 
tt.a&acattg 
tggattcttt 
aaccttc-ta-t 
ctcatttact 
e.aa gtgc r.ac 
aatRoaggtg 
gatctgggtg 
caaaac-cnag 
gagtctacag 
atu tstaaoc 
aacagtagee 
tatteggctg 
agaabagaaa 
ctteaagegc 
caatcaabga 
gaggctacgt 
a cagatc tgg 
cttaaaaagc 
gi^eggtttcg 
ttcatacac-t 
aaattaaaac 
at*iactybca 
atttactccc 
gtcctrj tata 
gteggaggge 
gtccgtggta 
gabcctcgca 
gtgttgtcga 



SO 
120 

iaa 

210- 

3 d a 

360 

420 

460 

SAO 

600 

660 

720 

780 

840 

900 

960 

1020 

1060 

114D 

1200 

1260 

1320 

1380 

1440 

1500 

iStiO 

1620 

16B0 

1740 

1S0O 

1860 

1510 



<^10> <i7 
<211> 2162 
<212> 13NA 

<213> Homo ,<!<a>ipxis 

<220> 

*221> misc_f eature 
<223> mcyte ID Ni^ ; 



27549$DCB1 



<4QO> 47 

gaeagegcaa 

(iaaftaa&aaa 

gggaagctcg 

cgctgt&agt 

eggggecaga 

tcgtgcctgg 
caggcagcC^ 
tgaggeaaat 
aaaaga»ga« 
gggactaccc 
ccttaatLcs 



ccgcactcca 
aagegaggge 
ggceggcagg 
t.T.cqctttcc 
ecaaggeggg 
g^aa« tca^^<^ 
ctcacataag 

ctgtcagtcc 
asaasacttije- 
tcatggcctg 
y pbatl-cgta 



gcctggacaa 
tggaegtc-tg 
gtttcwege 
etccagtgga 
cccggagcgg 
6g<^!^g^^gg<:f. 
cgcttcctgg 
sjtcttggagt 

atcctggcbg 
aacabgaaga 
gttggcttac 
aLaaal^tgg 



cagaaUjaga 
gagcaggtgc 
acgctyoc^c 
aaacgaaagc 
aac-tbeggtc 
ggar:gctcgc 
aagtgaagtc 
gatcatgaat 
agtcctcgcQ 

acaacattgg 
aott£ar;cBg 



cbccgtctca 
gcggctgcaa 
ccagctcccy 
tgggegggge 
ccagcbcggb, 
atggc-gcttg 
gtgctgtcct 
gag^aaggcg 
gtccccggca 
Ljagagagcgt 

ccagacctgc 
QatAttgiiag 



»aaa»aaaaa 
cggcagccgc 

gccacgagcg 
ccccQgctCa 
agagattcca 
gaacgeggge 
tctgggctcc 
gatcttgaag 
ccc-agggect 
tgcctbaacb 
aggetcaegg 



60 
120 
LBO 
210 

360 
420 
4S0 
540 
faDD 
660 
?2D 
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C 109C3A2" 



woui/HMJj pcr/usuo/zurw 

ggcagcgata gaCttbanst gsggaflgtcc tccbtacgga abigg^gcca acbttucr.ctt 1980 

gnacagctgg atgagcaaga tgqtgabgca gaacaaagca acggaaagat gaacggtagc 2040 

arctta&aCa *«a* tgaato naaggaagaa. agaaft.agaa.tf aggaggaatt aaattttaat 21 DO 

gaogabattc tgbgbccaca tggtgagtba bgcatabctg aaaatsaaag aaggctttflt 2160 

r.crtitaagagg cltifgagcaa acbgcagcag tacbtcccaa aggctcctga gtbtccaagt 2220 

tacaaagagt gctgttcaca gbgcaagatt ttagaaagag aaggifDaaga Raatg3e.r*<:e 22P0 

ttacabaaga taatCgc&aa cgagcaaaag acttcbctcc raaatttgtt ccaggataaa 2340 

aacagaccgt gtctcagtaa cbggccagag gatacaaats tccbctacal cgbgtctcag 2400 

ttctbtctag aagagtggcg gaaatbtgbt agaaagccta caagatgcag ccctgbgtca 2460 

tcagtbggga acagbgctct tbtgtgtccr; cacgsag tfct; r.caty tttaC a Lbtgcttcc 2 57 0 

atgace&aag nag&btctaa acbtabagct ctcatatggc ccagtgagtg gcaaabgaba 2 5 SO 

caaaagcbcb tbgtbgtgga tcabglaaCt aaaatc^ctf* gaattgaagt gggagatgba 26a0 

aacocttcag aaacaragba tattbctgag cccaaactct gbccagaatg cagagaaggc 2700 

ttatbgtgbc agcagcaaag Btfaccbgwgt g*a.tao6Ctc aagccac^fc.b ctatgbccat 27 6 0 

aaagtbgtgg ataataaaaa ggtgatgaag gabtcggctc cggaactgaa tgtgagtagt 2620 

tctgaaacag aaaagyac^a ggaagaagtt aaaccauatg gagaaaBsga tccagatttt 26S0 

aatcaaatca tgcatgcabt tbcagfctgct ccbttfcgacc agaatbtgbc aattgatgga. 294 0 

a«aatbl,l,aa. Utgatg&ctg tgccar;cct& ggcacccttg gcgtcattcc tgaatctgtc 3QO0 

atttbattga aggcbgatga accaattgca gattafcgctg caatggatga Igtcafcgcaa 3O60 

gttr.gt&tgc eageagaagg gbttaaaggt actggtcttc tbggacatba atctbtgaat 3120 

acbtgctgac tgctaagaaa bgacc-agagg ggaagaggaa ttttfacatgt tatfggiiatta 31SD 

aagcaaaggt ggatttaaga abtaaaccat tacabgcccc tbccaaaagg cagaaabcca 3240 

ttcaaacatg nctgtccuaa atirccttata tOaaataaag cagfet bgc*o r.gftt.ggacat 33 DD 

cagacttgaa ggaaatgtbt ccaatttbat atttaagggg ggtggtgggt gggagggggc 33 60 

aagtaaagac ggaa^aagtt tatftwjcagt aatagtaaab catgbttaca tatgagabtb 3d20 

atagbcgbgg gaggggaaba aagbtctgtt abatbtcctb gctcgagtbt cataccagab 3480 

WjU-QijCOC aCRRwcrgabt gT.fl. b craegta gatgggacaa cattctgctc tgaacgaaaa 3540 

gtaabttbag agacataacc bgcbtaccaa tgcctgLctb tgatbcatat lcbar?tttca 360Q 

ataaagcatg aaagtgaaga actbgtaaaa aaaa 3634 

<210> 45 

<211> 1661 i: 
<212> CWA 

<2H> HO*M> saplums 



<221> jrusc_Iealur^ 

<2J23> Incyte ID No: L973B75CB1 



<40Os- 45 

ctcgagccga btcggcbcga ggcggactgg 
tggcccnsrcc accbgggacc gboctgggga 
caaagttttg gcggaeccat cgctggggcfc 
Bcccg-atycc yggoggOOCfg asccabbgac 
ccaccccgcc gccbccggtg cabggggact 
ugij^gacatscs gccgggagag gc^ggcagcc 
aabgagcccc tgaagaaaga gcggctfcaag 
cagctgcgga gcaaacggga tgagtbctgg 
gagatcbggg atgcccbcaa ggctgc<-tscc 
gc<:c:aggcca tcctggatgg agccagcabc 
tacgatgagc fcgggcaatcg ctaccagcbg 
ctgctgctgg agcacacgga ggaggagagc 
cgccgtgagt trccgctgaa nntcjcgcccg 
agcctgcccg acacagtggg gcagcbcaag 
ccabccL^gc: AnOgnbguLt cl'.tctccfjgg 
gagaccaaga tccagaaaga ttbtgbcatc 
caasactifat anflcccacgg «<;i?cccgggft 
ccctgcbgct gccttccagb gctgbcattt 
gbgggtgagc cgtoaagsjca ccct-SICCttt 
tccegggagc aggcagccac acacgggcct 
gccctcaccc tgccbgtcbc ccgaag^aiiii 
ctggagccca gBtcCgtcst cbggbgctgc 
ggtctgaagc agctgctgtc- tcccLccLct 
agtaccar.tt: ccbtggaagg gagggaacca 
ggcgggccbc tgccbctccc tgccc-ccagc 
cgg^t.ggg^r. (/ttgCggcag ggr.cbgcaca 
gggaggbcbg cagcctggtc abggcctcca 



tctccggccg ggcaccgtcg cgggcccccc 60 
gtottfccnet cccotctcte ccctggcecg 120 
gagcgcgccc ccggggggag atcggggagc 160 
C-cgggacgcc gccgtccgcb gagcagccga 240 
ggcbgaggag -ccagcatuog caactgcgbg 3 00 
ccgggacacc -cccgcaagcg agcaggacgc 3 60 
bggaauagcg actaccCCat uactga^gga 420 
gacacagcgc ctgccttcga gggccgcaag 480 
tatq-cCgcts awcjr.caacga ccacu^sc bu !>4 0 
accctgcctc atggcacccb cbgtgaatgc €0-0 
cccatotdtft gcctgtcflicc fjc^jbgaac 
ctggagcccc ccgagcctcc acccagcgtg 720 
Cc<:acgggc6 aggacgtgag g<Jteagcgcc 7ft0 
aggcagctgc acgcccagga gggcafccgag 840 
aagtrtgrr.rft ragaccgcac «<;gg<i; tccag &0D 
caggtcatca tcoaccagcc cccaccaccc 960 
a.yaggcKosg ectggagcac taggccccca LO20 
tcttcagggg ccctccccbc ggtgbggctg 1080 
ot4gugc«<:ta c:gcgccRcca gtt«CL-ggta 11 40 
tgcaaccbtg tcagagaaaa ggcgaacagg 120O 
fc tcnaflcoac aagugwrtac oasg^ggntrc l?f>0 
cagctgtgct cactctggbt ttctgcbcag 1320 
gcccccafccc cct-ngctubc occbqggcao 1350 
cccgbgagcc ccagggctbg ggaagcctga ld40 
acaabtq<jca Ragabgaggc ygg tag tgua 15HO 
gggccatytc cbggctgtaa cccaggcagt 1560 
c-agcay tjtcc ctqbgaacag acabatctijc- 1520 



38/4-4 



BNSnOCID <WO 0110903A2TI > 



WO Ul/HWCIJ 



Krr/usuuyiisTS 



«--4D0> 43 

gcaaaaatbg bbtcaaaaaa bttgtgagga btbbgabtcc tatcbcgttc aaattt-tl-gg t>Q 
tbcatcgtca cctcaaccta gatbtgggat gtcaggggaa gaacfccagtg aatctactcc 120 
agagcobcaa aaagaaattt ctgagtcatt gagtgboacc agagaccagg abgaagatga 160 
taaggcbcca gagccaacyt gggctgatga bctgccbgcc actacctcab cbgaggccac 240 
cacoacocec aguccactgc bcagcacccc ttftg flat egg g^ga^gatc cbagatgttt 300 
ggaagccttg aagcctggaa actgtggbga atatgtggbt cgabggtabt abgacasaca. 360 
ggbcaae-tct tgtgcccgat tttgctbca.0 taaiit.gtaar. ggcbeagoa*. ebagattcaa i2D 
cagtgaaaag gaatgtcaag aaaectgcat tcaaggataa gcaagbaaat bggcctcbct 4S0 
cbatcoaaag oatay C v cct&AUlcc aca tattle ccaatacaaa tacagcacta 5<10 
tatttgagtg tatactgagt atttacaact tabacatgba atbgaattct cacLacagr^C 600 
Cbaggatgta catatbatta accartbata taggteagaa agebgagget ctgagaagtt 660 
tagtaacttg tcaactgtca cccaacbaaa aagttbcaga gcbgaggabt tagatttisga 72D 
gctsjbgtaac ttcaata^ai': agactctatc tacttcacaa ectgeaatgt gattctgatt 780 
ectttaatbe ctgbtqtabg tactatgfcca gctcaaaccc ctaccccbgt ccctqccctit 840 
acct.cr,-a0CC! aotc&cctcc ctaaccbccb tatgtccctc acagtagcaa gatgtaqgtg 9 DO 
abaggaagga cttcggtgbg agaatbagaa atgabgtaaa tBtttacgca gr/agtgctgg 9b0 
gataggagtc gggatggtga gggtagbtag atcttcgcct cacttgccct gaaagtggta 1020 
atagggagaa accaabctga atbaoaatba uLttsaatgta tcacagsctg eca<:ttbgta 1DB0 
Ltcccccasc atgttbggta acaagtgete aatgtatgtt aaaataaaga aggfcttttab 1140 
acccbtccat taaa«latgt ^«otQgg-iOC ttccatttr.* tggagtggaa bgggaaggee 1200 
cbtgacagcc aggaaccact tgaagctggc abccactctt gaacagagag tabtaaauac 1260 
a.gg«ttcaca ctgaaaagbg eg^ea^caaa ttgeaatgcg agtaabggag agbaaeggag 1320 
aatcgctbga cccbgggaag cagaggttgc agaaogcegg gattgcccca y 1-^71 

-c210> d4 

<311> 3634 

<212> DMA 

<213-> Homo sapiens 

<221> Tnie<:_tkaat--UTe 

<223> mcyte ID No: 19 L1&DBCB1 

<400> 44 

gcggcggcag tggcgcgcac aggbgattga ctgqccaticb Bccbgaagga cc^ic^ass t o 60 

ct^r.r.gotg gcaggtggcg aagcccabtg gggcggcggb geagacrgeg gcggcggcbg 120 

oggcggtctg getegggasg cgtbccLggsj gccoacdCoa tyscccr^cg ^otgcagcbg 18D 

gagaagg-egg cccggcg-c-tg ggcggagacg gtgcggcccg aggaggtgbc gcaggancat 24 0 

atcsapaccg i?ttaccgc»t c t uyer.gg&ci cesctgeatto gcsggL-gtgtg cagacgBaac 300 

cgcaaaggaa atecgaabtg cttggttggt atbggtgagc atabttggbt aggagaaata 360 

gatuanaata [fttttcataa catcgatgat cccaactgtg agaggagaaa aaagaactca 420 

tttgtgggcc tgacbaacct bggagecact fcgttatgtca acacabttct bcangtabgtf 480 

tttcteaaet tggag^tteg gcaggcactc tacctabgbc caagcactbg tagbgacbac 540 

atgctgggag acggcatcca agaasjaaaaa aattatgagc ctcaaacaat ttgLgsgcat 600 

ctccagbacb tgttbgcett gttgeaaaac agbaataggc gatacattga becafceagga 660 

tttgttaaag cctttrjjncct gtfwtactsffia r.^acssgcagg ^r.tjetc^aga attctcaaag 720 

ctctttatgb ctctatbgga agataebctg tcbacccaaa agaatccaga bgtgcgcaat 78Q 

attgtbeaac ageaabtd-o tggagaatieb gectabgtaa. ttgttcgcaa ccagtgbggc 840 

agagagbcta agctttbgtc aaaattbfcab gagcbggagt taaatatcca aggecacoaa 90<} 

cacbta<ic«g ftttutnCct.r gyaattcttg aaggsagaaA. aebtagaagg agacaabege 960 

tattttbgcg agaactgtca aagcaaacag aatgcaac€ia gaoagatbeg acttcttauc 10^0 

ctLccbUuca ctotgaactt gcagctaatg cgtttCgtct btgacaggca aacbggacat 1080 

asgaaaaagc tgaataccta cattggcttc tcagaaabfct fcggababggs gccCtaCgtg 1^0 0 

g»»C(*toii^g gtgggt<:ete cgtgtaCgaa ctcagcgcag tcctcataca cagaggagtg 1200 

agtgcbtatc cbggccacta cabcsjcccac cbgaaagatr; cacagb-cbgn tcaatijgtfet 1 260 

aatJCttaatg abgaa^aeat agaasagabg gaggggaaga aattacaact agggattgag 1320 

gaagabctag cagaacctbc taagtcfccag acaegtaaac ccaagfcgbgQ caii^USaaaC 1 38D 

caLtccbr.t.c gaaatgcftta tabgtbggbb tatagacbgc aaactcaaga aaagcccaac 114 0 

acbacbgttc aagttccagc ctbtcbtcaa gagebggb-ag abegggataa tbccaaattt 15^0 

gaggagtrsnt gCattgaR'at. ggctgagatg egtsagcaaa gtgtggabaa aggaaaagca 1560 

aaacacgaag aggttaagga gcbgbaccaa aggtbacctg cbggagctga gec-cbatqag 1620 

ttbgtctrtc CusjaabgcoL cs<!aaaagtgg r,tr/c2Attfaat ^ttcacctftc caaacctatb it»SD 

gataabcacg cbtgcctgbg ttcccatgac aagcbtcacc eggabaaaat abcaabtatg 1740 

aaqaggatat cbsaababgc agotgaiiaCt l-.tctatagtn uat a tggaug aggtccaaga IBOO 

ctaacbgtga aagccctgbg taaggaatgt gbagbagaac gbtgbcgcab abtgcgtctg 1&60 

aaaia<?c5ac Laaabgaaga ttataawact. ytr.aat&atv tg c r,gaaai}<: agc-agbaaag 19 20 



^7/44 



D1109C3A2* 



W0U1/HKKJJ pcr/usuortwre 

tgbtttaaca ^aagagtgtg aacattttta ttttaoaatt taggcaaaag tcactatcaa 

atggttgctt atttgt t :f.ea cacagccata taqttbttcc bagagg a tbt tgttLtgttg 1140 

ttgttaaoaa gafrtfctgctt a^gctagat gaaactttct atagaaaaaa aaaabtfjbtg 1800 

oaaqgtccag ttctoagCac catgtgagtb as teat acta <-sacta.ft C tt ot-ttctaaaa ISbO 

actoat.taat gtatr.ttata aattaccttt tcacatatgc aoaatctgtt tctactacaa 19^0 

tgttattttt ^ctaatgcct battgtca^ utetttttijs. aacatcctgt: egtgaaeats l&ao 

tgaatcaacc tgggcttaaa actgaaagcc aqttggsctg aaacatttga aabauUt^co 2D40 

ccagtaaaac cattcaatcc ataattflsita aar.esatettc taasaatggt ttLaatccgt 2100 

atagatgaca ttttgtagct ttggacacgg tggtaattaa gggccattaa ttbtacaetc 21SU 



caaggta 

<210> 42 
<211> 1B26 
<212> DNA 

■c213* Homo sapiens 



<22Q> 

-:223^ Incyte I'D No: 12S&J90CB1 
<220> 

<>22l> unsuro 
<222> 1755 

<223>a r t, c. g, or other 



<4O0> 42 

rn-cocgoctt tcCatageat agctcccttt aggtggaatg attcctataa gatttctcat SO 
tattaaatca tgcatttttc aagatgrr«ab caat^Ctly a tttnatctaa gctgatattc 120 
ecatttgtta gftagaaceac ccacatgcta gagagagagg aggaaatata cccacgacca 180 
cacagccagt bagtatrcay ttgct^bijo art create k& ggtgtcctgc ctcatggtag 210 
ttaaatgata tacagaaaag gtaaattttt anagaaatat ttatbaatat atbccbat** 300 
aacalttbaa a C gUia.C<:ac ft^aaetgg ttaatttttc cattccaaag Caaatgctaa 360 
gcacgcctat taatgaagca gtacttcbsa ttagtatatg Roatt-ttg**. gttaattaaa 42 D 
Ctcar.bgcac teaatgtgtc etrcttggta tagtggagga tttgoggatt ggaatabasa 4&D 
gtagagtgct tgctbangcc fcctgcttgata <4cwttt«tc tcttggggat gctggttcce 5<S0 
ggagggctgg gatatgatag atccttagcc caacacagac aagagatfcgt gB-acatitfton 600 
gtpagbccat uuaiJi^tyya gaegtatccc tarmacs tee aecsccccat gggagagabc 660 
tatgagtgga tgagagagat cagtgagaag bacaaggaag bgobgn^t:'* g^at.ttccta 720 
gua<jr<jMftct: fttgftgfl<:cce ccccatgtat tatctgaaga tcagccaacc atctggtaab 780 
cccaagaaaa tcatttggat guactsttfoa abtcaegiiCa ijasaatggat tgctcctgct 6d0 
ttttgt^aat ggttcgtcaa agaaatccta caaaaccata aagacaactc aagtatacnc ?C0 
aagcbccbba. tjgancnrbgsja ottctafcfifto <;t. tctagt.tc ttaacataga tggttatacc 960 
tacacttgga caactgateg tctttggagg aaatcccgtt caccccat«a taa lugvarn ID 2 a 
tgttltggg-a cguatubrjaa tcgaftatttc aatgcatctt ggtgtagtat tggtgccbct 1060 
agaaactgcc aagatcaaac attctgbggg acagggccag bnbcitsaacc ag&^aotaaa 1140 
{jctCfttSO^a rjotteataga gagcaagaag gaLgatatct Cgtgcttccb gaccatgcac 1200 
tcttatgggc agttaattct cacaccfctac ugcbacacca as^at.Aaatc aagtaaccac 1260 
c.KfcyaAatga tccaagttgg acagaaggca gcaaatgcat Cgaaagcana gbatggaacc 1320 
aabtatagag tbggatcgag tscacat»lvt ttaCatgcct Oittagggtc tCcaagagat liao 
tyggccrcgag acattgggaC tccctcctca batacgCbtg agctgaggga cagtggaac« 144 0 
tabgggtbtg tbcbgccaca nu<otCwj*.t.c o&QVCe&vot gtgaggagac catggaggct 1500 
gtgctgtcag tc-ccggatga tgtgtatgcg aaacactggc acbcg^acag tgctqgnnO'D 1560 
gtgacatctg ccacbatgct ijotgggiitrtrj ct.ggtg^-Ct gcatgtctcc tctctaagtg 1620 
iractctgccc aggctttttc ctgccagaaa fctcccctbta ttbttccbcc baqbo^agtc! L68D 
tttctttg»t ncbaattntg- ddattacctg tggflftCt.r tt gctgacttcc acagatcttt 17d0 
ttttctctcc ttctnCaacC gbctgfcacac acbttaabgt aabaabtbac ag»dlgttft. ].B0D 
agbgagatag gabattetCaC QCVCtq 

<210> 43 
<211> 1371 
<212* 1)NA 

<213> Homo sapi&ns 



<220> 

<221> misc_f eatare 

<223> inayto ID no; i7b6?74Cbi 
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BNSDCCC <WO C;109C3A2*I > 



WO Ul/lWUJ 



PCT/USU«y2187»f 



tccgtacaga, taaaggaaga gabcccactt 
t.gc t i^caga b acgaaiitgMt gtgctbgact 
ftijgacbbfctft CBggtactgc ttctctgaga 
tttbggcaca. ggaaactgtg asggccctgt 
tct.UcttiXfa tggc«bg«a!2 g^geatggga 
aagattbggc agccccagag atgccaactjj 
tccttcat.r/a aggcatctcc aggcaaggaa 
ttcctgeaac gaaggaggbg gtgcogacgt 
aaggggcgca gggtggctgt ctttgttcca 
cbgccagctc ccctgaatga tgaao*e t tc 
atgctgbccc ggcctcgcca gcccbctggg 
caagcbctgb tgctcgugag cctottistca 
ttatBgagaa aagtgtgaat tcaggtggag 
oaaycacagc tcoaagttca yaragytgcc 
cabttcctct ggagtfctgag accctgacaa 
saflttbtcge l-.r.arjgattag ttgagt.toca 
aagtctcgga tgttagggcg agaecctgca 
eoag^t-Cuct ygggcttcag ggctaggagg 
tcacabgcgg cbcaagtcac tcagaggctg 
ttacrctocta aaccacagct gttbgtgtte 
ttbaaaggaa tgataataaa cbtactbgcb 
gaagcbtntg agagggaatg ggabgabcct 
actcaagaga btcctcck^b ccctcn^ab 
caagagttaa abtgttcaca tbctagaabg 
ggataatbc* getaaatagt ggctatbttc 
tcaaaagtga bgcttcabab gcc-c-casig bg 
ggattyer^g gtttgccbtg atgcagccaa 
gtgtttcttc bfctatgtaaa tbgt!jtg«Ly 
aa 



ctoabacata tassgga^sat tcbgfcLJttgt 9DD 
cactgggtat bagl-cctgac ctgctbcclg 960 
tggc^ccagt gtgtgcggty gttggaggga 102D 
ctcagcagtja ccctcctcac aacftscttict: 10 BO 
Ltgtggacrtg cctbggccec aagtgaactc 1110 
cascatuctc ad'tgtaLtc uctgtteecc 1200 
aactaaagtc attggcccga ta.caaa.acat 1260 
grtgottctc at<:acc-fcg;ea tfc tgc t cga c 13 2 0 
gractgtbca ggctgccbgt catccccjggc 1380 
cisagcaKoec tctgcccttt itcctgtccct iaa0 
gcabtgtggg agatgccbgcv caggaatgaa 1^00 
cetrcttgga cttatbccec acctgatacc 156D 
agtaggccca ggc-ccccbga, ggcaccagtti 1620 
cttagngagg aaaaccabga caggcasatg 1680 
acaacaggtg gcatctggtg tgn:tgttctt 174 0 
gctgggtttt gggagaaagg agatgcbarc 1800 
agbbgagtat tagngagctb clctCtcaaU ISfiD 
gaggagcetg cccttttaac agaaccccag 1920 
fctgcatbtca. aggctatgtt g<jt.<:r;tttgt 19 80 
cacatatgtt gtgaatbttc cttggtbctt 2040 
ttagtf«ltty ctf-ottcttc ttccacbbca 2100 
accagttgcc tbttcagacc bgaggcbcta 2160 
r.t^r.-tgccacc attttttctg ggtgatgcag 2220 
tgbagaagcb tetggctcca gtbatctaat 2260 
agbggcaaga attataataa taaagggaag 2340 
ccacagoc^c tacgcjstrsng aUCCtttal-t 2400 
agegcgttcc agttcacccc acbcttgctb 2460 
tttcastaaa tfragtcCc:r.t tcctaaaaea 2520 

2522 



<2l0> il 

<211> 2167 

<212> ONh 

<?.~13> Homo sapiens 



<220> 

<221> mbsc.leabuiv 

<223> Incybe 11) No-. 011344OB1 

<40D> 41 

cgrcggtctt ggggggatgg Ctccatcatg 

ctgttggctb tiicaaaatgw cccawtc f-f- 

aattcaatta cacagtggat bgbagacatg 

gaaaaaUU-C aacttctatt tBaesbttagt 

atgtttactg gtgaaswitat tcctgttcat 

ttfctccattc taacagaaga ctggtcccca 

attattagca tgcfcttccag ctgcaaofl»« 

gt.grigaacat gtaacaagaa Cccaaagaaa 

bgatgccacb gttatcatcc tcctwu^tja 

gatcattcag tctttgaact ttaacctbtg 

acttcctttt *cbgcaCttt trtctLiitgt.g 

gtccaggcaa ctgacatgct ttbattagtc 

bcaaataCiia ttcuattttt tatttttatt 

tcatgtabtg tgcaataaca atgacttccb 

bggg^ct-tirC (*dc«uyaa^a yrtctiiacta 

agggccbatg aaatgaacbt actgatbata 

c-att»ottta &aa<j i,cacoe caactgctta 

gttaacagca tgaatcaaca tgcgtcbtta 

batbfiaattg ttC«LatPta ctgtcaagtb 

Cggcagagag gbacagtabg ttabccctab 

btcctcaatc tct^aaUtaM ftgcf C^tgaot 
tttattbtgg aagtaagaaa aatggcagtg 

gctbgbbatb tgb^catLaa ngal; l-Lbttc 

t&go^atcag caCgagcccL actgccCaaa 

ccgbaaacab tbttgtttgc aatctbgtll- 

o»tac:tttta tcgaaactct tgtcaagttc 

tcabctfcgtc cbttaatccb cjbaccctaaa 



gcgtcaabgc agaaacgact acagaaagaa 60 
ggaaCgac<:t taaatgagaB gagtgttcaa 120 
gaaggtgcac Gagatacctb atabgaaggg 1^0 
agtcgatatc ctbttgactc tcctcaggbc 2i0 
cctcatgbtt abaycasUag toatatwtgt 300 
gcgccctcag cccaaccagb tfcgtcttagc 36D 
aacagacgac e^oi^gataa tcctbr.tbat a20 
acaaaatggt ggbatcatga. tgatacbbgt 480 
agnttttff-ccr. &cbga.gaaaa tgagcacbtt 5<1D 
actggaagtg acctatziggc aatgaagact 600 
<satCttijgg^ gca.r.gttgBb cgctggtbca 660 
atacagtatt aatgcaggtg bcaggaaatc 720 
Cttrtaagct ttbggaaaag ctccaggccc 780 
fcggcggbbtt ggtacgLtca btgccggc-aw 840 
art.cctgcca tLcaatgatC aatgcatgat 9 00 
gtgggaafcat aaataaagbg agggatctaa 9 60 
tnt.ttggatt ccatgcactg tgatccCaag 1020 
aaggactgta abgaasuatc attg^ab^ttt 10HO 
gr.tttgac;at ggaagatttc caagtaacat 1140 
ggbgaaaata aattaabtbg btgtatatttfl 1200 
a«tat>4ygfjn atgfiiatggct caatcaaggc 1260 
atgattaaat tgctgcagbc cabaatttsg 1J20 
caagbanatt wcaotcr.ijtt aActtcctgc 1380 
cactatctca tctatttabg btbggaaacc 1440 
ctbttgttat wagtoanijbt. t-.Qaar.gttac 15U0 
ttccttgtaa attbtcttca ctbgtqagta 15&0 
aboasaawt.A oatttttyic? aQag^-ttea lfc,20 
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Cl 109C3A2T 



wo ui/ioyuj 



PtT/USaU/21S78 



<220> 

<221> miec_f eacurs< 

<223^ Incyte rp No; 5432579CB1 



<400> 3& 

gtccayccgc ggcggcggcg gtagcggagg 
atgcbgagct cccgggccga gg<:gg<i'gaLg 
clgcg-yaccg gggccgccgt cagatabaao. 
attgetgctg ctcttgcagc aggBBtatab 
aagcgtagaa aaggcctbgb gcctggccbb 
tccctgcbac aaggcctgtc tgcrtgtcct 
tc-cca-gtact ccagggatca gaags?ay ccc 
ttgcaccttc tgaaagcctt gtcetgccaa 
agetgctbgt tggatgtctb aagaatgtae- 
gatgctcacg aattactcca tgbcabbacc 
oc tcgnc t en cftc^t.ttgtt tg*ugtgcat 
aaacaaacta ccbgccgcac aagagggtca 
caacatceti. r.t.e:a.tgg&ag acLr:»£bftgt 
agtcctgbtc gabttgabac ctbtgafcagc 
ggtcaacaa t tgaccctgga ccactgccet 
gatgttgbgt gtgacaactg tacaaag-abt 
gbggaacacc agaggaccac ttbtgbtaaa 
ctctgr^atcc acctacagcg S«Ls3a<jctasj 
gagcaegbgc agttcaatga gtbccbgabg 
cataaaccttt tftcaacacntf cat-taaac 
caggatgggc cgggagcccc cacaccagbt 
atttttabga atgg<rjgc<;tg ct<:c:ccatct 
tfcccctctcc cagtbgtbee cgactac-agc 
gctgtcgtcc accatggaga catgcactct 
cctbcbgcca. ggaac-ccttfl; ctcaactagr? 
gtcegcaagg ccagcctgca ggaggtcctg 
^rgcgtccttt ccaijgatgca g<*accaij>*gc 
cctcctgcaa. ggctagagcb gatggcactg 
tactgttgcg tgbgcaaycg gccccactag 
acaggcbc-ca ccbga-gagcc agccccaatt 
gttcatgcgb gtaggtggtt ctgttgtgtt 
tttta^tc«t. vt.q&t&oxuv t^ltaaaau 
attgtaatgb taggbgttcfc cagaggggag 
gatcttttaE ttttaatafcg caggcccata 
baaoggcaac aafctbagaog aoaaa^jtgcc 
cattgtgaaa tabtccttcc aggebactea 
ctaaapoaca ccLlccbttfc bctat/Q 



•aga.cggfct.be acgccbccgg bgeggetgea 60 
accgcggccg acagggecat ccagcgcttc 12D 
g t cat tgaaga actgaggagt fca bagfj tgg a. 180 
gttatbtggg gtcccatbac agaaagaaag 2J0 
uttRal-.fcba{f ggaacacctg ctt<:atgaac sod 
gctttcatca ggbggc-tgga agagttcacc 360 
coctca^nc-c Agtftttbatc ct.taa.ca.otC! 02D 
gaagtbacbg atgabgaggt ctbagabgca 480 
aaatyyr?aga tctcatcatt tgaagaacag 540 
tegtcatbgg aagabgagcg agaccgccag 600 
i?c tgg ngti agcagtcaga asbaactcc^: 660 
cctcacccca catccaatca ctggaagtct 720 
^atbtggbct gca^a.cactg tgaacaccag 780 
cbttcacbaa gtatbecage cgccacatgg 940 
cacca-cttca tctcatcaga ateagtgegg 9 00 
gaagecaagg gaacgttgaa eggggaaaag 9 60 
cagtbaaaac tagggaagct ccctcagtgt 1020 
bceagi>r.:aeg gvwcycctct i/aagoygo* L ID BO 
abggacabtt acaagtacca cctccbtgga 1140 
aacaagaacc eagggcct&<: actrjgagcrtg 1.200 
cbgaatcagc caggggcccc caaaacacag 1260 
ttatbgccaa rgctgtcagc gccgabgccc 1320 
bcctccacat accbctbccg getgabgeca 1380 
ggacactctg tcacbtaccg acggtcccca 144 0 
tiatcagtgtfc: Uytggatctc wyniigacacC 150 D 
bccbccagcg cetaccbgcb gbtctacgag 1560 
<:aggagtgca agtctgaaga. abyaebgtgc 162D 
betgeactgb ccaggaaaaa agtaaaactg 1680 
agccttccag cctbctggtg tgttcbaaga 174 0 
cacaci;a^RC vaggctcccb sa»Ctigtvct 1BO0 
aagaaagcat tcabtatgtc cggagtgbcb I860 
aaotcagHtb i-: b tgaageca ecgtcttcfib 1920 
gtacctttgb ctnatcaacg tttccactba 1980 
aaaattgttg acaagaatLa atgaaatLaC 2040 
bttcacttbc 53atbgctttt gtnccac^bc 2100 
aaggatagca agagaacagg taaabgabgc 21 £0 

21B6 



<210> 40 
-=2115- 2522 

<213> Hvmo sapiens 



<220> 

<221> misc feature 

<223> Ificyt« ID No: S853753CB1 



<4O0> 40 
ctccgggcgb 

geggcabtag 

ctgaaabtgc 
^a^ailCtaftC 
gaaatagggc 
oggbgnacac 
bgtgtcbgac 
afcagcabcaa 
Caggagagca 
aagatgggcc 
agaaggbggb 
cannggctgc 



gctgc-cggcg 
agctg^gifoa 
cgaggaggag 
6cggr?i'.gct3iJ 
caagaatctc 
tct^gsauat 
tgaagectet 
tuaogataLa 
tbgctgctcc 
gbtcttbaca 
tgagttbgba 
cgacaccasg 
cctctgciict 
tc-tgaagege 



gcggtaggtg 

gcggcacagt 
gcoCct^ggg 

atcttggc ag 

occgg»oclc 
btggagcgag 
aagaagaaac 
agggatgbca 
ggagatgbtt 
gaogaoaaea 
agagcaaaac 
gttaaaqaeg 
acgaccbccg 



gcgcgcgggc 
gcgc-crtCL3at. 
atgaceggea 
t^^<^ttott.gt 
gagtgaaagg 

auctcttuac 

ctcagaatct 
c«gai3Lcatt 
tagttaaagt 
ttggcbacca 
etaaagbtge 
ttgatbctbc 
cac tggaggt 
actaebfctet 



ccggcgggcg 
guagaaggay 
gabccgccbg 
iggct.br/aa^ 
acbgaccafcg 
tcigtaotogir 
caaccccabg 
LtfcCnetCaa 
bgac-cagatc 
bggaLacaca 
caaagctagc 
bga?iacaacg 
ggactggagc 
ccbtoangtg 



gttggcbbga 60 

gttsgr.tggc:^ 12D 

tggggactgg iso 

gu^c r. tggga 24 □ 

ctggatcacg 3 00 

Ct;t.gttggoc! 3 60 

gtggatgtga d20 

t tcqat.gr> tg 450 

tgbcacaaaa 540 

fctbgenaate 600 

caaggagtag 66 0 

atggtcaaaa 720 

agbgagaaag 180 

ctcttaaagt 840 
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WO U1/NWU3 



PC M r/USU0/21»7Ji 



tgttgaataa tccagatata stgaijiicflsa cvrf r.r.gg/iac-t CyccegEjaob cc«gta3bg& 1050 

tgcatfyagat gabyaggaac caggstccgag ctttgagcaa cct-agaaagc atcccagggg 1140 

gatataafcgc ttbaagg<-«gc: atgtacacag atattcagga accaabgc;tg agtgctgcac 1200 

aagagcagtt tggtggtaat ccatttyctt I: tee 1 teas cs^tacatcc t c tg^tgaau' 12 60 

gtagtcaa^c Itcccgtaca gaaaatagag atccactacc caatceBtgg gctccacaga 1320 

cttcccagag ttcatcagct t^c&ycggca ctgcrce.gc*c tgtgg^tflgc aetactggta 13S0 

gtactsJC^nO tggceicttct gggcagagta ctactqcgcc aaatttggtg cctggagtag ld40 
gagctagtat gtbcaacaea ccaggaabgc agagrtbgbt gcaacaaata actga&aacc 

.rjacaact ba t geoaaacatg ttgtctgccc ccbacobgag aagcatgatg cagbcacbaa 1560 

gccagaabcc tgaccttgct geacagabga tgctgaataa bccccbatr.t gctggeaatc 1620 

ctcagctbca agaa.caaa.tg agacaacagc tcccaac-ttt cctccaacaa atgcagaatc 1680 

ctgatacact atcagcaatg tcaaacccta gagcaabgca ggccttgtta cagattcagc 174Q 

agggtttaca gaca.r.t»gca acggaagc-cc csjgsrcc t cat c-ccagggbtt actcctggct 1300 

tgggggcatt aggaagcact ggaggcLctt caggaactaa Cggatctaac gccacBccta i860 

gtgea&Bcac aagtcccaoa geagtfaacca cUgaacctdg auatcajjcajj tttafctcagc 1920 

agatgctbca ggctcfctgct ggagtaaatc ctcagccaca gaatccagaa gtcagacttc 19S0 

agcaacaact ggaacaa/XC atftgcaatug <Jn t. 1-t l". t Qaa ocgtgaagca ancttgcaajj 2 040 

^tctaatasrc aacaggaggt gatatcaatg cagctattga aaggttactg ggcbrccagc 2100 

catcatagcB gcatttctgt Btcr.r.ya&fta aatgtaattr. atttttuata aeggcl'cfcta 21(50 

lactttaaaa taccbgcbtt atttcatttt gactcttgga attctgtgct gttataaaea 2220 

aacccaatat gatgcatttt aaggtggagt acagtaagat gtgtgggttt tr.cbgtabtt. 22«a 

ttcr.cttotg gaaCftgbgag aattaaggct actacatgca bcactbctgc attbatbgta 2340 

attttttaaa aacatoacot tttatagCtg ggbgaccaga ttCtgCcctg catctgbci^a 2400 

gtttattbgc tttttaaac-a t tag tattf gtatftaaLLt atatagaata aaagcattaa 2460 

aaagaagcaa atcatctgca ctctataatb tgtggtacag tattgcttat tgtgacbttg 2S20 

gcatgcattt ttgcaaacaa tgc t u CaaSa f-ttataotac? taataatttt. at tt tat Lbs 2560 

Lata<?aatat agagtatgca catttgggac tgcattbcbg gaaacatact gcaataggct 2640 

cbctgagcaa aacacctgte actaaaaaag tgaagabaag aftaatacttt b^aagctgag 27 00 

r.atttectaa ttgtafcagaa tcttacagca tcttbgacaa acatcbccca gcaaaagtgc 27 60 

cggttagbca ggbttgttga aaatacagta gaaaagctga btctggtcat ctcbttaagg 2826 

acaattaact. gtaeagacac ataatgtaac atty tcbcoa ^ab teat tea cagattgact 2860 

gbaaattacc fctaatctttg tgcagacbga aggaacacbg bagtabacrc eaaagtgcat 2 940 

ttgcctagga cttctcagtt tctccr^abag gt.^gr.cteair aggnabt^iaa fcttCtftaalb 3 000 

Baaatgttgc fcttcactgaa aaagtgtctt gabgttbcag btattcttaa fccgccaCaaa 3 060 

aaaabagaac tatcttttgg gtttatcbgt ttbctcatgc acaggesata cacaaatr.r.a .1120 

sajar.gag ttg r.rjiagoc-ftnC C utttcbgaaa tefctttaata gLtctatbaa gaaatagtta 3180 

oatabtgbgc fcttfccagagc ctcagagaca gggggacgcg gtgccggggt ggggcagegg 'J 2-10 

aatcbgtcct ggatggggrc a.Qfctt&(S'!it« iitactggcaa c«^agat 3267 

<210> 31 
<211> 2412 
<212> DWA 

<2ii> Rancj sapiens 



<220> 

<221> mLsc_f eabura 

<223> Incyte ID No: 1.J399-iaiCHl 

<400^ 31 

2ggc?tgt!3ttf c^gcggcggc gpcggcggcc 

cagagfctgaa caabgaccat agttgacaaa 

ragaatoagc <^togc;rtEJCtc cjjagiJCag tc 

agctggggtg ctgtgbcttc attgaatgat 

gtaccbggrg ctgtagt.Ua ttcg&8tto& 

caaaaggpbc oagecctagg tgafcggcatc 

gagaagatbt gtcbtarigtg gc(5at>feaAc:t 

ggcaatacct gttbtgccaa tgcagcactg 

tgtacaatgc aagcacatat tacccaggca 

atgttcgt.<^ft tc»aty«gab gcCtfCStata 

gabgeccatg aattccttca atacactgtt 

asJcaaCaaat tagacasjaca cacccaggcc 

tacctaagat ctagagtcaa atgbttaaat 

tabctljgaLa baacabtgga gatoaaggct 

ttbgtgaagc eggaacaget tgebggegaa 

atggLbccag cttcaoagag gttcactatc 

ctgaaacgtt ttgcaaattt taccggtgga 



gagggggabg gagegagcac egagcegggt 60 
gcbtctgaat ctbcagBccc atcagccr.at 120 
tcacctggasj acatggatac aggttctgee 180 
gtgtcaaabc acacactctc Cttagqacca 740 
trjLy Lacctg ataaatcaaa accatcacca 3 00 
gcbcctccac agaaagtCct tttcccatct J 60 
CaCasJog tty uay^bg'SU'cb ccatjaatttg 420 
cagtgbbtaa ccbacacacc acctcttgcc OSO 
acatiit-cattf ^agaagsc-tL tbatatgatg 540 
ctcagbaabr ctggggacgt battaaaoca 600 
gcbasgoac-c trcgtttbgR aaaccaagan 660 
gabgctatgc agaaagcatg cttga.atggir: 7^0 
accacbcbbg btbgtcagat atttggagga 780 
tgcaagggcg bttcqgebac btt.f.yqtC?C!a 840 
gcbcagagbg bcaacaaggc attggagcag 9 00 
aactegbaca agbgcagcaii gr.yCa«aa«L2 9 60 
cabagnbect etasttgtbet Cacactttct 1D20 
aaaattgeba aggatgtga.r4 ^ \;& qoC tq&ij. 10&O 
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WO U1/HWCI3 



<4O0> 29 

gtgcttggcg cctgcgcbgg acgactcggc cggtagtgga gatgtccgge cggtctaagc 60 

gggagtctcg cggttccact cgcgggaagc rjagagtctga. gtegcggggc agetccggtc 120 

gt-&tcaa<jca gaagcgagat cgggagcggg agccbgaggc ggcgagctcc cggggcagcc 180 

ctgtgcgcgt gaagecjggag etcgagctgg cgagcgcgcg cgaggccccu gcttctgttg 240 

tcocatttat gcgggtgaag cgggagcgcg aggtcgctga ggacbcggag cct^agcggg 3 00 

aggtgcgagc aaagaatggc tgagtggatb ctgaggaccg gaggatfccgn caetgcccgt J to 

ac^ttfOacac cattaacagg agtgtgctgg acttbgactb tgagaaactg bgttctabct 420 

ccctctcaca catcaatgct catgcctgtc tggtgtgtgg caagtacttt <jaBggccggg 480 

gtttgaagto to&caccbac attcacaata tccagtttag cciaccatgtt btcctcaacc 540 

tccacaccct caagttttac cgccttccag acaactatga gatcatcgat tcctcattgg 600 

aggstatcao gb.atgr.gttg aagcccaclt tcacaaaaca g^aaattgca. aaCtttfgaca 660 

agcaagccaa abtgtcccgg gcatatgatg gtaccactta cctgccgggt attgtgggac 720 

tgaBtfsacBt. eaaggccoai; gattatgcoa feCCfctg test licaggctcta tctaatgttc 760 

cbcctctccg gaactacttt ctggaagaag acaaCtataa gaacatcaaa cgtcctccag 840 

gggatatcac gttcttgttg gtceagcgtt ntcrcagagct: giiUgaaflAatf Otctgtfaaco 900 

cbcgaao-ttb caaggcacat gtgtcteccc atgagatgcb tcaggcagtt gtacfctbgca §60 

gtaagaagac ttttragatc acca&soaag gagfttggcgt tgt»^t.t.tot[i tcttgutttc 102D 

tuaatsjctcb gcactcag-ct ctggggggca caaagaagaa aaagaagact attgfcgactg 1080 

atgttttcca ggggtccatg aggatcttca ctaeaaegcb tceccatcct g&tctgcr&o 1140 

CagaaB'waa aaagcatfltg ctccataabg acgagtacca gg-agacaatg gtggagbccn 1200 

cbtttatgta cctgacgctg gaccttccta cbgccccccb ctacaaggac gagaaggagt: L260 

ftgptc-6tc.nl. Lceceaagtg ^cactcttca acatc^taflc baagttcaat ggcatcac-tg 1320 

agaaggaata. taagactbac aaggagaact ttctgaagcg cttccagctt accaagttgc L380 

ctccatatiit aatcttttgt atcaayatfat tca^tnieigsa caacttcttl uttgagaana L34Q 

abccaactab tgtcaatbtc cctatbacaa abgtggatcb gagagaabac ttgtctgaag 1500 

aagtacaagc egtBcacaag aataccaxct estgacctcBt tgtvjaacatc gtgCtttg^ny 156Q 

gcassrecctc csiRQaiJcbcc taccgqatcc acgbcrcttca tcatgggaca ggcaaatggb 1620 

atgaattaca agacctccag gtgactgaca tccttcccca gatgatcaces ctgtcegagg 1680 

ctt(4C^tr.i-<a gatttugaag aggcgagata atunttfaAac caaccagcag ggggcttsraa. 1740 

ggaggcgtcb agggcttbgc tcccaagggc tgtggctgat gatggtaaat aagaacacag ia00 

aagctgtagc tgeacacagg ttggfitggtg ggcittcctag gccagcccag pbtgtatgua IB 60 

Lbctasjctac accagsgcac caagagccca cbtgcctggg atggccccac actgtcactc 1920 

agctgttctt tgatcatbtt tttctagatt gatgctcctt tctcccatgc attgagcti^c 198 0 

fi^ttJtagc-tC cagcagggca gaacccttct ccagatgtgfc gtaactbatg tcttgagtab 2040 

cbgggagtag ttgaagaaea gataattcct tccaaacatc aagccttggg attcttggag 2100 

caagcagaaa gccagtii^t; t r.Kijc!r.<:tgCt agftggtgtfag uni,t?;tcct» tggttccccc 216D 

catttccfcga tbtgtatbtt fcagatggabt aaatagtcbc ctgtctttaa accaaaaaaa 2220 

aaaaa 2225 



■z21C» 20 
<211> 3287 
-:2125- DNft 

<7.13> H<Mno ^aplcms 



<220> 

<221> Jnisic_ feature 

<223> incyte ID Ho: L^bl37 6CBl 

-.-40O> 30 

f;»gifC!Jcgt.t- esocoggcutg ct^cgcsicgij 

gggaggagga agcggtggct gctgcggatg 

ggc^gr.tgcrc gagcgcetga cccgggc^tg 

cacacccgcb acggtggccc Cgcgcccgtt 

gctgtccagg cctgccugot ctggtgCr:rg 

tgctbgcctg ngtcaccgcc gccgccgccg 

gcggtccbcc gggctccx^tg g6tagcgr:cg 

ccgcbgcctc cgcggacgcc aaaatcatga 

ngy«^ttcg<: cgtgccogag «»t«gctuotr 

gtttbaaatc acatactgac caacttgtgt 

anyatacctf", gautcascaL ggaattcabg 

cacaaaacag gcctcaggat cattcagctc 

ctacatfato aactcctaat agtaactcba 

gtttaggtgg ccttggggga cttgcaggtc 

tcbcbgaaul aoagagbcag atgcagcgac 

agatcatgga aaatcccttt gtt^agagca 

fcaatbaLgiJy caabccacaa atgcagoagt 



ygcc^lccsaa gsrsacgtggg ggoagggjjca 60 
toggtgtgag cgagcggcgc ctgaacacac 120 
ogo<N6ti3ai3C-C: tucaccgatjc tccggg^ccc 180 
gctactgagg cggcgtgctc tgcattc-ttc 240 
CtijgotccK; ctLuctwcc tgcfcccctoc 300 
ccacagccat ggccgagagt ggtgaaagcg 360 
c^cgarjocga aut?t.yc:tanc gcccccgrigg 420 
aagbcaccgt gaagaccccg aaggaaaagg 080 
toc&gcfttftt tftaggaagaa atctctaaac 540 
tgabatbtgc tggaaaaatt ttgaaagatc 600 
atMacttac bgbbcacctb gtcatbaaaa 660 
agcaaacaaa tacagccgg^ agcafetgtta 720 
catctggttc tgcbacbagc aacccbtttg 780 
tgegtagctt gggtttgaat aotapcnft^t B4D 
aacbttbgtc taccccbgaa atgatggtcc 900 
tgctctcaaa tcctg^cctg «tgauar,nijb 960 
bqcibaca^aa aaabccagaa atbagtcata 1020 
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wo ui/njyaj 





tly 


11* 




His 


Asp 


Val 


Thr 


Thr 










650 










Trp 


Arg 


Leu 


Leu 


Thr 


Pro 


Gly 


Asp 


Tyr 










6.6S 










GLu 


Gly 


Tyr 


His 


Ser 


Val 


Thr 


Arg 












680 










GLu 


Gly 


Pre- 


Phe 




Cy£ 


Asn 




Val 










695 










Gin 


Arg 


Leu 


Arg 


Glu 






AJa 


Mb 








710 










Asp 


Leu 


Arg 


Arg 


Arg 


Leu 


GLU 


Rrg 


Leu 



72S 



■-.2tC» 28 
'-212> ONA 

<^L3> Homn sapi<#ns 



pct/uscm/iists 



Ala 


Trp 


GlY 


13 ly 


AEp 


Tyr 


65b 








661) 


Met 


Val 


Thr 


Ala 


ifer 


Ala 


670 










67b 


Cyt; 


Ajrg 


val 


Thr 


Phe 


Glu 












D3V 


Lau 


Thr 


Lye 


Thr 


Pre 


Lys 


700 










70S 


aiy 


Ala 


LyB 


vsj 


Pro 


Pro 


72S 










720 


Arg 


Gly 


Gin 


Lys 


ABp 





7 JO 



<221> misc_f eature 

<223> Incyce ID Ha OSJ371SCB1 



<400> 28 

tgaaggacCt tbccasraracc caagdeoaca cactasjaagt ettsjeasctg aagggaggca &0 
ctcctbggcc tccgcagccg atcacatgaa ggbggtgcca agtcbcctgc tctccgtcct 120 
ccbggcacag gtgtggctgu iwaccogSCtt ggCnc-CcttCrt cr%Ucagtoyc cassgacccc IPO 
agcccctceig aaccagacca geagggtagt gcaggctccc agggaggaag aggaagaCga 240 
geaggaggee agcgagifaga agij^eijgtga gyaagagaaa gcctgg^tga tgcfce«gu-wg 3DO 
yeagcagebt gecaaggaga cttcaaactt eggabtcatge ctgcbgcgaa agatctccat 360 
gaggcacgat ggcaacatgg Lctbctctcc atbtggcatg tccttggcca tgacaggctt 420 
gestgoCgutJS gecscaggg c cg;icbgaaac ccapatcaag ayacagctcc acttgeagge 460 
ccbgaagccc accaagcccg ggcbcctgcc ttccctcttt aagggactca gagagaccct 540 
ctcccgcft^c otg^ajx^KQU gofttctOrtCa gsjgu^J tf.tr. gocttcatec aca«gs«ttt (500 
ttjat^bcaeiE! gagacttbet tcaafcttatc caagaggtafc tttgatacag agtgcgcgcc 660 
tatgaatttt cgcaatgcct cacaggecaa aeggctcatg aatcatta^a ttaacaaaga 72 D 
^^Usgsgsi aaaaLtccca aacbgtttga tgagatLaat cctgaaacca aatbaabtcb 79 0 
tgtggattac atetegttea aagggaaatg gttgacccca tttgarcctg tcttcaccga 84 D 
agt<:gacect f.toc^ctgu acaaglacaa gar>c«tt«ag rjtavc ca t g* tgt«cugCi}« 900 
agg<rcagtbt gcetccacct btgacaagaa ttttcgttgt catgbccbca aactgcccta !)60 
ccaaggasat gccaccatgc tggr.ggtect retggagaaa a f.fjijgtgat l: ac^tcrfocet LD?D 
t pnagac Lac ctgaccacag actbggtgaa gaeabggebe agaaacabga aaaccagaaa 1080 
catggaagbt ttctbtccga agtbeaaget agatcagaag batgagetgc atgagebget 1140 
r.»tjgO!9gati3 ggaateaEjaR gaabcttctc accctttsct gaccbtagbg aacbctcagc 12 00 
fcacbggaaga aabctc-caag tatccagggt tttacaaaga acagbgatbg aagbbgatga 12 60 
aeggggcecr. gaggotttftgrf emgyaatett gtca£f*it™*tt icUgcttatt CC»tgcctc:c 1^20 
fcqtcabcaaa gtggaccggc catbtcattt catgatctat gaagaaacct ctggaabgct 1380 
cctgtttctg ggcaggr/tgg cgaatccgac r.ct.cot.fiir.aa ttcaggacat. gc?fet^6g^»a 10 40 
cLLcsbgcba tagtagatgc bgaafcetgag gtatcaaaca cacacaggat accagcaatg 1500 
gatggcaggg gagagtgtbc cbtbbgttct taacbagtbt agggbgtbet ca&fiib.ftaa.ta 1560 
O«0t!*otncc cactbatcbp agiiaggabac atfccaaagac ccccagcaga bgcetgaaac 1620 
ggtggacagt gebgaacebt ababatabtt tttcctacac atecatacct atgateaagb 16HQ 
b-t!Sattt(i>C.*ft aattaggcac dgbaagagat taacaataat aacaacabta agtaanatga 1740 
gttacbtgaa cgcaagcact gcaataccat aacagtcaaa ctgattaLag agaaggctac 1BD0 
t-.ftsgtgactc atgggcfragEf rtg^atacaca gtgtgfrnnac atLgggcaag gggagaattc- I860 
acabccfcggg tgggacagag caggacgatg caagattcca tcccartact cagaatggca 1920 
rgccrjobtaa gacr.lMUaga KKntKt&ttl ctggaaU I' Ut tO«Lttaatg tttttp^aco 19^0 
atggttgacc atggbtaact gagactgeag aaagcaaaac catggataag ggaggactac 2040 
bacaftaafjea tta^jaC lent aoatattttt taaa^ia^aaa 20^0 



<210> 29 
<;211> ^225 
<212> DNA 

<212> ][omo eapiens 



<22£» 

<221> mi sc_f eature 

<2?:i> IncytLC TD JiC>! 114tSlCRl. 
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mo 








Ser 


Gly 


Leu 


Glu 


Asp Gly Asp 


JJL.U 








155 








Glu 


Glu 


Gin 


Asp 


Ala 


Asp 


Pro 


Trp 










170 








Pro 


Thr 


Arg 


Phe 


Ser 


Gly 


val 


lie 










185 








Trp 


Arg 


Tyr 


Abp 


Trp 


Val 


Thr 










200 








Asp 


ser 


Arg 


Thr 


Trp 


Trp Gly 


sex 








215 








Asp 


Ala 


Val 


Phe 


Pro 


Ala 


Asn 


Ser 








210 








Asn 


Leu 


Leu 


Pro 


Glu 


Pro 


Gin 


V3t 1 










245 








Pro 


Gin 


Thr 


Trp 


Leu 


Gin 


Gly 


Gl y 










260 








lie 


Leu 


Ala 


Cys 


Pro 


VaL 


ser 


Asp 








27S 








Ala 


Pro 


Ala 


Ser 


Gly 


Ser 


S*r 


APP 










290 








ftsn 


Tyr 


Lys 


Ala 


Mat 


Arg 


Lys 


LeU 








30S 








Cye 


Pro 


Asn 


lis 


Thr 


Arg 


7.1* 


Tyr 








320 








Gly 


Leu 


Lys 


Leu 


Tyr 


val 


Met 


GlU 








335 








}Ti B 


Glu 


Leu 


Gly Glu 


Pro 


Glu 


va i 










350 








Gly 


Asn 


Glu 


Ala 


Leu 


Gly Arg 


Glu 








3t>5 








Phe 


Leu 


Cys 


His 


Glu 


The 


Leu 


Arg 








3$D 








L^U 


L«*U 


9$r 


Glu 


Met 


Arg 


lie 












355 








ABp 


Cly 


Tyr 


Glu 


T].<* 


Ala 


Tyr 


Ti 1 s 








410 








Trp 


Ala 


Glu 


Gly Arg 


Trp 


A en 


Asn 








325 








ABn 


Phe 


Ala 


Asp 


Leu 


Asn 


Thr 


Pro 


















Gly 


Lys 


Val 


Pro 


HLS 


He 


val 


Pro 






0S5 








Thr 


Tyr 


Tyr 


Thr 


Leu 


Pro 


Asn 


Ala 






070 








Ala 


Vflil 


lie 


Ly£ 


Trp 


Met 


Lys 


Aru 










065 








AST] 




His 


Gly 


Gly 


G1U 


Leu 


Val 










500 








Thr 


J\ry 


Thr 


Pro 


Trp 


Al* 


Ala 


Arij 










5L5 








Asp 


Ala 


Ve 1 


Phe 


Arg 


Trp 


Lai] 










530 








Leu 


AlB 


Met 


Gin 


Asp 


Thr 


Sf*r 


Arff 










545 








Phe 




Val 


His 


Gly 


Asn 


I J.e 


1 1 c 










560 








Val 


Pro 


Gly 


Ser 


Mot 


Abti 


Asp 


r 11 fi 








575 








Pha 


Glu 


val 


Thr 


Va ] 


Glu 


Leu 












590 








Asn 


P. |u 


T.i£lU 


Pro 


Gin 


G1U 


Trp 


t~i i i "i 

lo 1 U 










605 








Thr 


Tyr 


L«U 


GlU 


Gin 


Vttl 


Aru; 


Met 










620 








Asp 


Lyk 


Asp 


Thr 


G1U 


L*U 


Gly 


lie 








635 











105 










150 


Tyr 


Asp 


Gly A La Trp 


Cys 


Alfc 




160 










165 


Phe 


Gin 


Val 


Asp 


Ala 


Cly 


His 




175 










180 


Thr 


Gin 


Glv 


Ser 


Asn 


Sftr 


val 




190 








1J5 


Tyr 


Lys 


val 


Gin 


Phe 


Ser 


Asn 




205 










210 


Arc 


Asn 


His 


Ssr 


ser 


Gly 


Met 


22fl 












Asp 


Pro 


Glu 


Thr 


Pro 


val 


Leu 


235 










240 


Ala 


Arg 




lie Arg 


Leu 


Leu 




25D 










255 


Ala 


t'ro 




Leu Arg 


Ala 


Glu 




2 6i 








270 


Pro 


Asn 


Asp 


Le*u 


Phe 


Leu 


Glu 




2SU 








2*5 


Pro 


Leu 


- 

A&p 


Phe 


Gin 


His 


His 




295 








300 


Mst 


Lyr.! 


\? J.J1 


Val 


filn 


GlU 


Gin 




310 










315 


Sftr 


11* 


v l y 


Lys 


S*ir 


Tyr 


Gin 




325 










330 


Met 


Ser 


Asp 


Lye 


Pro 


Gly Glu 




340 










"1 A R 


Arg 


Tvr 


val 


Ala 


Gly Met 


fir b 
















Leu 


Leu 


Leu 


Leu 


Leu 


Me t 


ri n 

u dlj 




37D 












Glv 


Asn. 


Pro Arg 


*; 7_ i 
1 


TlJT 


Arg 


385 












Leu 


LeU 


Tre* 


£er 


we l 


A&n 


P-rn 




000 










405 


Arg 


Gly 


Ear 


Glli 


LOU 


Vesl 


Gly 




4 15 










420 


Gin 


Ser 


lie 


Asp 


Leu 


Asn 


His 




430 










435 


Leu 


Trio 


Glu 


Rla 


Gin 


Asp 


A Bp 




445 










4 SO 


Asn 


His 


HiB 


Leu 


Pro 


Leu 


Pro 




4&0 










465 


Thr 


Val 


Ala 


Pro 


Glu 


■fhr 


Ars? 




075 










4 SO 


lie 


I'r-o 


Phe 


Val 


Leu 


Ser 


Ala 




090 










49S 


Val 


Ser 


Tyr 


Pro 


Pho 


ASV 


MSt 




505 








510 


Glu 


L**u 


Thr 


Pro 


Thr 


Pro 


Asp 




520 










525 


Thr 


Val 


Tyr 


ai* 


Gly 


Ser 


Asn 




535 










5dD 


Arg 


PtXj 


Cys 


His 


sur 


cin 


Asp 




550 










555 


Asn 


Gly 


Ala 


Asp 


Trp 


His 


Thr 




565 










570 


Ser 


Tyr 


Leu 


His 


Thr 




Cys 




580 










585 


Cys: 


ft«:p 


Lys 
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cm 


Cys 


Lys. 


He 


Leu 


Glu 


Arg 


Clu 


Ci)y 


CUU 


GJJ) 


Ast) 






655 










700 










705 


Glu 


A13 


L&u 


His 


Lya 
710 


Mfet 




Ala 


Aim 


Clu 
715 


Oin 


Lys 


Thr 


Ser 


Leu 
720 


Pro 


Asm 


TiftU 


Ph*> 




Aep 


Lya 


Asn 


Arg 


Pro 


Cys 


Leu 


Ser 


Asn 


Trp 










725 








730 










735 


Pro 


Glu 


Asp 


Thr 


Afip 


val 


Leu 


Tyr 


He 


val 


Ser 


Gin. 


Phe 


Phe 


Val 








740 










745 










750 


Glu 


Glu 


Trp 


Arg 


Lya 


Phe 


Val 


Arg 


Lya 


Pro 


Thr 


Arg 


Cys 


Ser 


Pro 






755 










760 










765 


Val 


Ser 


Ser 


val 


Gly 
?ia 


A an 


Ser 


Ala 


Leu 


Leu 
?7b 


Cys 


Pro 


His 


Gly 


Gly 
780 


Leu 


Met 


Phe 


Thr 


Phe 
785 


Ala 


Ser 


Met 


Thr 


Lys. 
790 


Glu 


Asp 


Ser 


Lys- 


Leu 

795 


11© 


Ala 


Leu 


lie 


Trp 
BOO 


Pro 


Ser 


Glu 


Trp 


Gin 
805 


Met 


lie 


411) 


Ly* 


Leu 
810 


PI)** 


V«l 


Val 


ASP 


Hi* 


Val 


11* 


Ly* 


11« 


Thr 


Arg 


1 le 


Glu 


Val 


Gly 








815 










820 










825 


Asp 


Val 


ABtt 


Pro 


ser 


Glu 


Thr 


Gin 


Tyr 


He 


Ser 


Glu 


Pro 


Lys 


Leu 








630 










B35 










84 D 


cyB 


Pro 


Glu 


CyB 


Arg 


Glu 


Gly 


Leu 


Leu 


Cys 


Gin 


Gin 


Gin 


Arg 


Asp 






845 










H5D 










855 


Leu 


Arg 


Glu 


Tyr 


Thr 


Gin 


Ala 


Thr 


He 


Tyr 


Val 


His 


Lys 


Val 


Val 






660 










a 65 










870 


Asp 


Ash 


Lys 


Lya 


Val 


Met 


Lys 


Asp 


Ser 


Ala 


Pro 


Glu 


Leu 


Asn 


Val 




675 










BSD 










635 


Sex 


Ser 


Ser 


Glu 


Thr 


Glu 


Glu 


Asp 


Lys 


Glu 


ftLU 


Alu 


Lys 


Pro 


AJ.'P 










890 








855 










900 


Gly 


Glu 


Lys: 


Asp 


Pro 




Phe 


A»n 


cm 


Il<! 


M<it 


Hi ft 


Ala 


Ph* 


Sor 








905 










910 










915 




Ala 


pro 


Pfte 


Aap 
920 


am 


A.an 


lieu 




lie 
925 


Asp 


Gly 


Lys 


He 


Le-u 
J3-C- 


Stir 


Aep 


Asp 


Cys 


Ala 


Thr 


Leu 


Gly 


Thr 


Leu 


Gly 


Val 


He 


Pro 


Glu 








935 










940 










»45 


Ser 


Val 


lie 


Leu 


Leu 
950 


Lys 


Ala 


ABg> 


Glu 


Pro 
955 


He 


Ala 


Aap 


Tyr 


Ala 
9KQ 


Ma 


Met 


A Bp 


Asp 


Val 


Met 


Gin 


Val 


CyB 


Met 


Pro 


Glu 


Glu 


Gly 


Phe 






965 










97n 










975 


Lys 


Gly 


Thr 


Gly 


Leu 
980 


Leu 


Gly 


Hia 

















<210^> IS 

(211> 227 

<212^- PRT 

<213>- Hcunu sapiens 

^220> 

<221* ml 3c_f cature 

<223> Incyte ID Wo ; 197387SCD1 

<d00> ia 

Mot Gly Asn cys Val Gl.y Arg cm Arg Arg Glu Arg Pro A1& Ala 
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BNSOCC'D <Y12 CC903A2T 



wo ui/wyuj 



<4O0> S3 

trccbggbbgg gccgogbcct tcfcgctyccc ca^jjccgatg cagacctcgc cgcgagccgc 60 
aggtgtcctg gagbctggcc caggaccbgg ccccacagga gtcccagcag gggtagctcc 120 
bcccgggac-a agg-accgaag tgcgacggbc agtngttc«5 t^cccotgcc tacbgsagca 180 
aaaggaagt^ atcctbcatc tacacc^cag agggbcccca accgcctgab ccacgagaag 240 
b.eewcabacc fccctacaaca tgcctacaafc cctgCcruact gdtafccccbg- egaagfeggaa 100 
gccbtcgaca aggccaggea gg&aaacaag ccgattttcc tcbcagtcgg gtactccacc 360 
bgccwcbggb gccacafcirafc gpaagaEjaaa t^cttecaga *tg*gg« S ai; tqijccg-cctg 420 
ctcagtgagg actttgtgag bgtgaaggba gaccgtgagg. agcggcctga cgtggacaag 480 
gtgt&Ciafcga eutCcgtflOa ssicc«c<sagi> agcggegggg gcbggoeab gaatgtgtgg 540 
ctgactccca acctccagcc ctttgtcggg ggcacctatt tccctcctga ggabggcttci 600 
accf:g*sgtcg gcttttegcftf, 1 agr.tjr.tgetg agaatacgfig aacagtggaa acagaacaag 660 
aacacccbgc fcagaaaatag ccagcgbgbc accactgccc tgctggcccg atcagagatc 720 
agcgtgggtg accgc^agct gccgccctct gccgc-caccg tgaacaatcg ctgcttecag 780 
cagctggatg agggcbatga tgaagaatac aybggctfccg ctgaggcccc caagbttccc 640 
acgccggbga tccbgagctb cctgttctcc tacbggctca gccatcgact gacCcaggab 900 
cgcbctcggg cccatfcoyat cgcctbgcat acoctsjaaaa tgatgEjctaa. cg-sgg-BCalc 960 
cgggaccatg tggggcaggg cbtbcaccgc tacbccacag accgccagbg gcacgtccct i020 
oacbttgasja. aDatgotf-'ta i,gaoc;»agca casjotcgttg tgg^cbnttr; gcagg^tto 108 D 
cagctcbctg gtgatgaatc ccactcbgac gtggccaaag gcatcctgca gtaogbggcfc 114 0 
oggageotga <jcn:a ccggf.c cggaggcttt: tatagegcag asgatgesga ct.cgcc ccca 1200 
gagcggggcc agcggcccaa agagggcgcc bactatgtgt ggacggtcaa agaggbtcag 1260 
cftgctcx'teo cggagcctgb gttgggtgcc accgagccgc tgacct^agg ccagctcctc 1320 
atgaagcacb acggcctcao agaggctggt aacatcagcc ccagbcagga ccccaagggg 1380 
gagctgcagg gccagaatgt gctgac-cgbc cggbactc-gc tggagctgac bgcbgcccgc ld4D 
bttaacbtgg atntcgagBC eg tfjcggRcin ttgctcaittt cagtfBCtgga CTBagrstcbtc 1^00 
caggcccgga agcatcggcc caagccgcac ctggacagca agatgctggc Cgcctggaat 1S60 
ggcttcatiiCT tgtc-aggxt* tgctutyact ggggctgr.oc tgggr^aaga c^ggcCr/atc 162D 
aacbatgcca ccaatggtgc caagttccbg oagcggcaca tgtttgatgt ggccagtggc 1680 
cgectgatffc ggaccbgcta cacoggcect ggggggactg tggagcacag ceacccaccc 1740 
bgcbggggcb tccbggagga cbacgcctbc gtgg LsctfEjg gcctjjctgga ccbatatgag 1&00 
gecbcacagg agagtgcgtg gctcgagtgg gcbctgcggc tgcaggacac acaggacagg I860 
ctctttbgijg acbcccagyg bCTocnflctav ti'-cttfcatfty aggctgagcb ggggsJctgge 1920 
ctgcccctgc gtctgaagga cgaccaggat ggagcagagc ccagcgccaa btccgbgbca I960 
gcjccacaatre; tgctecgg^f. gofctisgctbi* acgggccaca aggacbggat ggacaagbgt 2040 
gtgbgcctab tgaccgcctt bbccgagcgc atgcctcgtg tcccggtggc gtbgcccaag 2100 
atggtccgc-g ccc tec cage ccagcagcag ace ctcaagc agategtgat c-tgtggagac 2160 
ogtcapgeca apga^a^caa ggcccbgcfcc cagti3ogl.ee wctnUabctw cattcct«ac 22-20 
aaggtgctga ttctggctga tggggacccc Ccgagcctcc tgtcccgcca gcbgcctbCc 2260 
otd'fQ t^o-csc- tc<:ga<sgi5tt ygaaa*w*g gecatitgeab atgr.gtgr.g* ga^t^aagci; 2340 
bgcbcagtgc ccatcactga bccctgcgaa fctacgaaaac tactacabcc atgactgccc 2400 
<:aa^eecctt: ggggtggggc agaaggtgae gcatcccaac tgactag^ga ctcaggccct 2460 
gcagggcccb atagaacctg bggccatccc tqagcaccct cccaccaggt gaccbcaacc 2520 
atactcactg ccccccttgg gcac-ccactc accctagaat saacttaacs gtgtc-ccgtg 2SB0 
gtnaaaoasa etaeiaaaa 2597 

<210> 54 
<211> 2385 

<ziz> raw 

<213> Homo sapiens 

<22l> mi sc_f ea tur^ 

<223> Incyte ID No: 55B3922CB1 

<4O0> 54 

gaecgacccg ccacccgecg bagcccgcgc gccccbggca ctcaatcccc gcc-atgbggg 60 

ggcbccbgcb cgccctggcc gccttosicgc cjjgccgtcgg cccqgcbctg ggggcgccc-Q 120 

ggaacrtcgijc gcbggg^etii gc-gc-agcccg ggaccac-caa ggtcccaggc tcgeccccgg 180 

cccbgcatag cagcccggca cagccgccgg eggagacage taaegggace tcagaacagc 240 

Mt.iiCcciggfet t^gagf-csat.^ aagaesgaaaa aggtcattat gaeg^^gcag 6^gaag<:ta.a 2 GO 

ctcbaactcg ccccacccca ctggtgactg ccgggcccct tgtgaccccc actccagcag 360 

ggs<?r;ctc;aa ccccjgotsag ^acs&ag^aa cragy^ bgr.ro teef brgg^t wt.gaaifCorjc 4 20 

tgegagctte agatagcegg cttgaggcat ccagcagcca gtccttbggt cttggaccac 480 

nccuauuacg ncti;aacabt oac toay n no tggaggnegg ogfti:ntatat uauugagoyt 

ggtgtgctga ggagcaggac gccgabccat ggtttcaggt ggacgcbggg caccccaccc 600 

gcbtytogoy bgbtaCtisca cacyUcnota aotctgr.cbg g«ugCaCu«o tg<jub.rjat:«t t(,o 



BNSDOCID <WO 01109O3A2TI > 



wo m/ioytij 

natacaaggt 
gtggijatgUa 
tgccggagcc 

tccbtoaggc 

acagcabtgg 
ctogtfeia.jcfc 
acjcccctgpg 
gagggaaccc 
tga^ccctija 
agggccgctg 
cactgtggsJa 
catbgcccac 

tggbgtccta 
carX'aiiatCa. 
tgcaggacac 
bcaacggggo 
ccaactgctb 
bgccccaggs 
t'jaucabtgc 
etgccgtgga 
bgctgacccc 
ggsactgtcg 

goaggcgccb 
gggcaggctg 



PCT/Ui!iJ0y21S78 



cgcagtattt 
ccaggtggcc 

ccctgcgtcg 
ga&Ecta«tg 
gaagagctac 
tgay c tag eg 
gogggsgttg 
acgggtgacL- 
Cggctabgag 
gaacaacc-ag 

tcactacacc 

cccattcgac 
tgul.gl-gttt 
cagccgccga 

tgaggtcacb 
gCgggagaac 
ag^iciCTtggtg 
tgggattaac 

ggtcaccttb 
cag'jotgctfc 
ggageggcta 
garctgtofea. 
tca.ttaaa.gc- 



aatgacag-tc 
cctcccaatt 
cgcttcattc 
3agnL.ee t-gg 
ggatcctctg 
aagcaggtai? 
cagggectga 
gagecbgagg 
cttctgctcc 
eggctgetet 
atcgcctacc 
egcatcgaCc 
gatgggaagg 
cbgcccaabg 
coctLbgtgf? 
Jtgacbcgca 

ccctgccaca 
aogsjLcccog 
gtggagctgt 
sacaflagsctf 
agggacaagg 
catgacgtga 
atggbgactg 
gaagagggec- 

aggggacaga 
c gygaagg 
tac-cgggcac 



ggao^bygr.g 
ca-gacccaga. 
gcctgctgcc 
cctgcccajjt 
acccbctaga 

agctgtatgt 
Lgcyctaoyt 
tgatgcagtb 
ctgagatgcg 
aceggggfcte 

tgccccacat 
ccaccgtggc 
taactgccaa 
ccccgtgggc 
gceu'tgtcta 
gecaggaett 
gge*s3catg»Q 
cctgbgacaa 
C^CtC'OtOftC 

acaeggaget 
ccacggcgtg 
ccagtgccgo. 
ccttc ccctg 
caBcbggggc 
aggabtgata 

gg««gai3^RS 
cttaaaaaaa 



aacbecagtg 
cragacot.gg 
ctcagac-ccc 
ctttcagcab 
ccccaacatc 
gatggaaatg 
Cjgnfcgscatg- 
CDtgtgccab 
ca.ttc»cctg 
agagctggtg 
tr.ttgirtgac 
cgtcc-ccaac 
tccbgasa^g 
cc-tccacggg 
tgcccgcgag 
tgcbg^caqt 
ctccgtgca-c 

gttccctcac 
cr.*<:ctgr/ag 
tgggattgcb 
ggg^ggggab 
gggctaccat 
caabttegtg 
caaasitgccc- 
cctgcggttb 
agagggaaog 



cbgaacctcc 
ct-ccaggg3g 
aalgacctat 
cacaabtaca 

teggacaage 
cattfygafeeg 
gagttcctgc 
cl-gccctcca 
ggctgggccg 

catcacctgc 
eggscautaa 
ggtgagctcg 
ctc.&cgcc.ca. 
aatctggcca 
ggc-aacatca 
tacctacnca 
gagaabgaab 
cagiitgc^ca 
gaegebgtea 
tattggt;rjtr 
tcagtgacac- 
cbc^c-caaga 
ccggaccttc 
aagageccta 
ac-aaag-bgag 



■720 

7B0 

84Q 

?00 

$60 

1D20 

10SO 

1140 

1200 

) 260 

1320 

UBO 

1440 

1 bOO 

1G60 

lb20 

16S0 

17dD 

1B0D 

1660 

1920 

1930 

204n 

2100 

?t6D 

2220 

2340 
2385 
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